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iCJMAN RNA-ASSOCIATED PROTEINS 



TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of human RNA-associated 
proteins and to the use of these sequences in the diagnosis, treatment, and prevention of cell 
proliferative, autoimmune/inflammatory, and infectious disorders. 



Ribonucleic acid (RNA) is a linear single-stranded polymer of four nucleotides, ATP, 
GTP, UTP, and GTP. In most organisms, RNA is transcribed as a copy of deoxyribonucleic acid 
(DNA). the genetic material of the organism. In retroviruses RNA rather than DNA serves as the 
genetic material. RNA copies of the genetic material encode proteins or serve various structural, 
catalytic, or regulatory roTes in organisms. RNA is classified according to its cellular localization 
and function. Messenger RNAs (rnRNAs) encode polypeptides. Ribosomal RNAs (rRNAs) are 
assembled, along with ribosomal proteins, into ribosomes, which are cytoplasmic particles that 
translate mRNA into polypeptides.^ Transfer RNAs (tRNAs) are cytosoiic adaptor molecules that 
function in mRNA translation by recognizing both an mRNA codon and the amino acid that 
matches that codon. 

The unspliced precursors of mature mRNA transcripts are called heterogeneous nuclear 
RNA (hnRNA) transcripts. hnRNA is generally larger and more unstable than mRNA. 
Immediately upon its synthesis, hnRNA is assembled into protein-containing complexes called 
heterogeneous nuclear ribonucleoprotein particles (hnRNPs). (See, for example, Honore, B. et al. 
(1995) J, Biol. Chem. 270:28780-28789.) hnRNPs associate with small nuclear ribonucleoprotein 
particles (snRNPs) which are stable RNA-protein complexes that function primarily in splicing 
introns from hnRNA, Each snRNP contains a single species of RNA and about 10 proteins. The 
RNA components of snRNPs recognize and base pair with specific sequences of the hnRNA 
intron. Five different snRNPs associate at the intron of hnRNA to form the spliceosome, a 
multicomponent RNP complex which catalyzes the removal of introns and the rejoining of exons. 
Also associated with the snRNPs are various accessory factors that stabilize intron-snRNP 
interactions. In humans, these factors include spliceosome associated protein 49 (SAP 49) and 
SAP 145. (Champion-Amaud, P. and Reed, R. (1994) Genes Dev. 8:1974-1983.) 

Proteins are associated with RNA during its transcription from DNA, RNA processing, 
and translation of mRNA into protein. Proteins are also associated with RNA as it is used for 
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structural, catalytic, and regulatory purposes. RNA polymerases are proteins that transcribe RNA 
from a DNA copy. The HIV Tat protein binds specific sites in the viral RNA to prevent premature 
transcriptional termination. Various proteins arc necessary for processing of transcribed RNAs in 
the nucleus. Pre-mRNA processing steps include capping at the 5' end with melhylguanosine, 
5 polyadenylating the 3' end, and splicing to remove introns. The spliceosomal complex is 

comprised of five small nuclear ribonucleoprotein particles (snRNPs) designated Ul, U2, U4, U5, 
and U6. Each snRNP contains a single species of snRNA and about ten proteins. The RNA 
components of some snRNPs recognize and base pair with intron consensus sequences. The 
protein components mediate spliceosome assembly and the splicing reaction. Autoantibodies to 

10 snRNP proteins are found in the blood of patients with systemic lupus erythematosus (Stryer, L. 
(1995) Biochemistry WTL Freeman and Company, New York NY, p. 863). 

The process of splicing may involve more than the removal of an intron from an RNA 
transcript. For example, an RNA transcript may be subject to alternative patterns of splicing, 
resulting in the generation of different species of mRNA from a single primary transcript. In 

15 addition, certain transcripts may be subject to /mm-splicing, in which an exon from one transcript 
is joined to an exon of another. Often, specific protein cofactors are required to mediate splicing 
under these special circumstances. For example, a new splicing factor, PR264, has been 
implicated in the /ra«.y-splicing of a thymus-specific c-myb transcript in humans. (Vellard, M. et 
al. (1992) Proc. Natl. Acad. Sci. USA 89:251 1-2515.) 

20 Heterogeneous nuclear ribonucleoproteins (hnRNPs) have been identified that have roles 

in splicing, exporting of the mature RNAs to the cytoplasm, and mRNA translation (Biamonti, G. 
ct aL (1998) Clin. Exp. Rheumatol, 16:317-326), Some examples of hnRNPs include the yeast 
proteins Hrplp, involved in cleavage and polyadcnylation at the 3' end of the RNA; Cbp80p, 
involved in capping the 5' end of the RNA; and Npl3p, a homolog of mammalian hnRNP Al, 

25 involved in export of mRNA from the nucleus (Shen, E.C. et aL (1998) Genes Dev. 12:679-691). 
HnRNPs have been shown to be 

important targets of the autoimmune response in rheumatic diseases (Biamonti, supra ). 

Nascent tRNA transcripts are spliced by unconventional mechanisms that are distinct from 
those employed by the spliceosome. In this case, splicing is carried out by specific endonucleases 
30 and ligases that recognize secondary structural features of the tRNA. This process contrasts with 
the spliceosomal reaction, in which specific nucleotide sequences of the intron are recognized. In 
addition, tRNAs are further processed by removal of 5' sequences and by chemical modification of 
some of the nucleotide bases. tRNA processing has been extensively studied in yeast, in which 
tRNA-specific splicing factors have been identified. (See, for example, Shen, W. C. et al. (1993) 
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J. Biol. Chem. 268:19436-19444.) 

Proteins are also a part of the translation machinery of the cell. The eukaryotic ribosome 
is composed of a 60S (large) subunit and a 40S (small) subunit, which together form the SOS 
ribosome. In addition to the 1 8S, 28S, 5S, and 5.8S rRNAs, the ribosome also contains more than 
5 fifty proteins. The ribosomal proteins have a prefix which denotes the subunit to which they 

belong, either L (large) or S (small). Initiation factors, many of which contain multiple subunits, 
are proteins which are involved in bringing together an initiator tRNA, the mRNA, and the 
ribosomal 40S subunit, Eukaryotic initiation factor 2 (eIF2), a guanine nucleotide binding protein, 
recruits the initiator tRNA to the 40S ribosomal subunit. Only when eIF2 is bound to GTP does it 

10 associate with the initiator tRNA. elF2B, a guanine nucleotide exchange protein, is responsible 
for converting eIF2 from the GDP-bound inactive form to the GTP-bound active form. Other 
initiation factors include elFl A, eIF3, eIF4F (a complex including eIF4E, eIF4A, and elF4G), and 
eIF5. The elongation factors EFla, EFl P y, and EF2 arc involved in elongating the polypeptide 
chain following initiation, and the release factor eRF carries out termination of translation. (See 

15 V. M. Pain (1996) Eur. J. Biochem. 236:747-7710 

Other important RNA-associated enzymes with roles in translation are the aminoacyl- 
transfer RNA (tRNA) synthetases. Protein biosynthesis depends on each amino acid forming a 
linkage with the appropriate tRNA. The aminoacyl-tRNA synthetases are responsible for the 
activation and correct attachment of an amino acid with its cognate tRNA. The 20 aminoacyl- 

20 tRNA synthetase enzymes can be divided into two structural classes, and each class is 

characterized by a distinctive topology of the catalytic domain. Class I enzymes contain a 
catalytic domain based on the nucleotide-binding Rossman 'fold'. Class II enzymes contain a 
central catalytic domain, which consists of a seven-stranded antiparallel B-shcet motif, as well as - 
and C- terminal regulatory domains. Class II enzymes are separated into two groups based on the 

25 heterodimeric or homodimcric structure of the enzyme; the latter group is further subdivided by 
the structure of the - and C-termina! regulatory domains (Hartlein, M. and Cusack, S. (1995) J. 
Mol. Evol. 40:5 19-530). Autoantibodies against aminoacyl-tRNAs are generated by patients with 
dermatomyositis and polymyositis, and correlate strongly with complicating interstitial lung 
disease (ILD). These antibodies appear to be generated in response to viral infection, and 

30 coxsackie virus has been used to induce experimental viral myositis in animals. 

In many cases, mRNA translation, localization, and stability are controlled by regulatory 
proteins that bind to the 5' and 3' untranslated (UTR) regions of mRNA. An example of such a 
protein is Spnr, a mouse spermatid perinuclear RNA-binding protein, which may be involved in 
RNA transport, translational activation, or localization of RNA to cytoplasmic microtubules 
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(Schumacher, J. M. et al. (1995) J. Cell BioL 129:1023-1032). RNA-associated proteins may alter 
and regulate RNA conformation and secondary structure. These processes are mediated by RNA 
helicases which utilize energy derived from ATP hydrolysis to destabilize and unwind RNA 
duplexes. The most well-characterized and ubiquitous family of RNA helicases is the DEAD-box 
5 family, so named for the conserved B-type ATP-binding motif which is diagnostic of proteins in 
this family. Over 40 DEAD-box helicases have been identified in organisms as diverse as 
bacteria, insects, yeast, amphibians, mammals, and plants. DEAD-box helicases function in 
diverse processes such as translation initiation, splicing, ribosome assembly, and RNA editing, 
transport, and stability. Some DEAD-box helicases play tissue- and stage-specific roles in 

10 spermatogenesis and embryogenesis. All DEAD-box helicases contain several conserved 

sequence motifs spread out over about 420 amino acids. These motifs include an A-type ATP 
binding motif, the DEAD-box/B-type ATP-binding motif, a serine/arginine/threonine tripeptide of 
unknown function, and a C-terminal glycine-rich motif with a possible role in substrate binding 
and unwinding. In addition, alignment of divergent DEAD-box helicase sequences has shown that 

15 37 amino acid residues are identical among these sequences, suggesting that conservation of these 
residues is important for helicase function. (Reviewed in Linder, P. et al. ( 1 989) Nature 337: 12 1- 
122.) 

Overexpression of the DEAD-box 1 protein (DDXl) may play a role in the progression of 
neuroblastoma (Nb) and retinoblastoma (Rb) tumors (Godbout, R. et al. (1998) J. Biol. Chem. 

20 273:21 161-21 168). These observations suggest that DDXl may promote or enhance tumor 

progression by altering the normal secondary structure and expression levels of RNA in cancer 
cells. Other DEAD-box helicases have been implicated either directly or indirectly in 
tumorigenesis. (Discussed in Godbout, supra .) For example, murine p68 is mutated in ultraviolet 
light-induced tumors, and human DDX6 is located at a chromosomal breakpoint associated with 

25 B-ccll lymphoma. Similarly, a chimeric protein comprised of DDXl 0 and NUP98, a nucieoporin 
protein, may be involved in the pathogenesis of certain myeloid malignancies. 

Ribonucleases (RNases) are RNA-associated enzymes which catalyze the hydrolysis of 
phosphodiester bonds in RNA chains, thus cleaving the RNA. For example, RNase P is a 
ribonucleoprotein enzyme which cleaves the 5* end of pre-tRNAs as part of their maturation 

30 process. RNase H digests the RNA strand of an RNA/DNA hybrid. Such hybrids occur in cells 
invaded by retroviruses, and RNase H is an important enzyme in the retroviral replication cycle. 
RNase H domains are often found as a domain associated with reverse transcriptases. RNase 
activity in serum and cell extracts is elevated in a variety of cancers and infectious diseases 
(Schein, C.H. (1997) Nat. BiotechnoL 15:529-536). Regulation of RNase activity is being 
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investigated as a means to control tumor angiogenesis, allergic reactions, viral infection and 
replication, and fungal infections. 

Many snRNP and hnRNP proteins are characterized by an RNA recognition motif (RRM). 
(Reviewed in Bimey, E. et al. (1993) Nucleic Acids Res. 21:5803-5816.) The RRM is about 80 
5 amino acids in length and forms four p-strands and two a-helices arranged in an a/p sandwich. 
The RRM contains a core RNP-l octapeptide motif along with surrounding conserved sequences. 
In addition to snRNP proteins, examples of RNA-binding proteins which contain the above motifs 
include heteronuclear ribonucleoproteins which stabilize nascent RNA and factors which regulate 
alternative splicing. Alternative splicing factors include developmentally regulated proteins which 

10 have been identified in lower eukaryotes such as Drosophila meianogaster and Caenorhabditis 
elegans . These proteins play key roles in developmental processes such as pattern formation and 
sex determination, respectively. (See, for example, Hodgkin, J. et al. (1994) Development 
120:3681-3689.) The RRM includes the ribonucleoprotein- 1 (RNP-l) RNA binding motif which 
is found in snRNP proteins, hnRNP proteins, splicing factors, mRNA binding proteins, and 

15 transcriptional regulatory proteins. Other hallmarks of RNA binding proteins include regions of 
repeated arginine and serine residues (RS repeats). 

The discovery of new human RNA-associated proteins and the polynucleotides encoding 
them satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
prevention, and treatment of cell proliferative, autoimmune/inflammatory, and infectious 

20 disorders. 



SUMMARY OF THE INVENTION 

The invention features substantially purified polypeptides, human RNA-associated 
proteins, referred to collectively as "RNAAP^' and individually as "RNAAP-1,'^ "RNAAP-2," 

25 "RNAAP-3," "RNAAP-4," "RNAAP-5,""RNAAP-6," "RNAAP-7," "RNAAP-8," "RNAAP-9," 
"RNAAP-10," "RNAAP-1 1," "RNAAP-12," "RNAAP-13," "RNAAP-14," "RNAAP-15," 
"RNAAP-16," "RNAAP-17," '^RNAAP-18," "RNAAP-19," "RNAAP-20," "RNAAP- 
21,""RNAAP-22" "RNAAP-23" "RNAAP-24"and RNAAP.25" In one aspect, the invention 
provides a substantially purified polypeptide comprising an amino acid sequence selected from the 

30 group consisting of SEQ ID NO: 1 -25 and fragments thereof. 

The invention further provides a substantially purified variant having at least 90% amino 
acid identity to at least one of the amino acid sequences selected from the group consisting of SEQ 
ID NO: 1-25 and fragments thereof. The invention also provides an isolated and purified 
polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the 
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group consisting of SEQ ID NO: 1-25 and fragments thereof. The invention also includes an 
isolated and purified polynucleotide variant having at least 70% polynucleotide sequence identity 
to the polynucleotide encoding the polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-25 and fragments thereof. 
5 Additionally, the invention provides an isolated and purified polynucleotide which 

hybridizes under stringent conditions to the polynucleotide encoding the polypeptide comprising 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-25 and fragments 
thereof The invention also provides an isolated and purified polynucleotide having a sequence 
which is complementary to the polynucleotide encoding the polypeptide comprising the amino 
10 acid sequence selected from the group consisting of SEQ ID NO: 1-25 and fragments thereof. 

The invention also provides a method for detecting a polynucleotide in a sample 
containing nucleic acids, the method comprising the steps of (a) hybridizing the complement of the 
polynucleotide sequence to at least one of the polynucleotides of the sample, thereby forming a 
hybridization complex; and (b) detecting the hybridization complex, wherein the presence of the 
15 hybridization complex correlates with the presence of a polynucleotide in the sample. In one 
aspect, the method further comprises amplifying the polynucleotide prior to hybridization. 

The invention also provides an isolated and purified polynucleotide comprising a 
polynucleotide sequence selected from the group consisting of SEQ IDNO:26-50 and fragments 
thereof. The invention further provides an isolated and purified polynucleotide variant having at 
20 least 70% polynucleotide sequence identity to the polynucleotide sequence selected from the 
group consisting of SEQ ID NO:26-50 and fragments thereof The invention also provides an 
isolated and purified polynucleotide having a sequence which is complementary to the 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:26-50 and fragments thereof 
25 The invention further provides an expression vector containing at least a fragment of the 

polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1 -25 and fragments thereof In another aspect, the expression 
vector is contained within a host cell. 

The invention also provides a method for producing a polypeptide, the method comprising 
30 the steps of: (a) culturing the host cell containing an expression vector containing at least a 

fragment of a polynucleotide under conditions suitable for the expression of the polypeptide; and 
(b) recovering the polypeptide from the host cell culture. 

The invention also provides a pharmaceutical composition comprising a substantially 
purified polypeptide having the amino acid sequence selected from the group consisting of SEQ 
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ID NO: 1-25 and fragments thereof, in conjunction with a suitable pharmaceutical carrier. 

The invention further includes a purified antibody which binds to a polypeptide selected 
from the group consisting of SEQ ID NO: 1 -25 and fragments thereof. The invention also provides 
a purified agonist and a purified antagonist to the polypeptide. 
5 The invention also provides a method for treating or preventing a disorder associated with 

decreased expression or activity of RNAAP, the method comprising administering to a subject in 
need of such treatment an effective amount of a pharmaceutical composition comprising a 
substantially purified polypeptide having the amino acid sequence selected from the group 
consisting of SEQ ID NO: 1 -25 and fragments thereof, in conjunction with a suitable 
10 pharmaceutical carrier. 

The invention also provides a method for treating or preventing a disorder associated with 
increased expression or activity of RNAAP, the method comprising administering to a subject in 
need of such treatment an effective amount of an antagonist of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-25 and fragments thereof. 

15 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID 
NOs), clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to 
assemble full-length sequences encoding RNAAP. 
20 Table 2 shows features of each polypeptide sequence, including potential motifs, 

homologous sequences, and methods and algorithms used for identification of RNAAP. 

Table 3 shows the tissue-specific expression patterns of each nucleic acid sequence as 
determined by northern analysis, diseases, disorders, or conditions associated with these tissues, 
and the vector into which each cDNA was cloned. 
25 Table 4 describes the tissues used to construct the cDNA libraries from which cDNA 

clones encoding RNAAP were isolated. 

Table 5 shows the tools, programs, and algorithms used to analyze RNAAP, along with 
applicable descriptions, references, and threshold parameters. 

30 DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is 
understood that this invention is not limited to the particular machines, materials and methods 
described, as these may vary. It is also to be understood that the terminology used herein is for the 
purpose of describing particular embodiments only, and is not intended to limit the scope of the 
35 present invention which will be limited only by the appended claims. 



7 



wo 00/1 1171 PCT/US99/19361 

It must be noted that as used herein and in the appended claims, the singular forms "a," 
"an," and "the" include plural reference unless the context clearly dictates otherwise. Thus, for 
example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an 
antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled 
5 in the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any machines, materials, and methods similar or equivalent to those described 
herein can be used to practice or test the present invention, the preferred machines, materials and 

10 methods are now described. All publications mentioned herein are cited for the purpose of 

describing and disclosing the cell lines, protocols, reagents and vectors which are reported in the 
publications and which might be used in connection with the invention. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate such disclosure by virtue of 
prior invention. 

15 DEFINITIONS 

"RNAAP" refers to the amino acid sequences of substantially purified RNAAP obtained 
from any species, particularly a mammalian species, including bovine, ovine, porcine, murine, 
equine, and preferably the human species, from any source, whether natural, synthetic, 
semi-synthetic, or recombinant. 

20 The term "agonist" refers to a molecule which, when bound to RNAAP, increases or 

prolongs the duration of the effect of RNAAP. Agonists may include proteins, nucleic acids, 
carbohydrates, or any other molecules which bind to and modulate the effect of RNAAP. 

An "allelic variant" is an alternative form of the gene encoding RNAAP. Allelic variants 
may result from at least one mutation in the nucleic acid sequence and may result in altered 

25 mRNAs or in polypeptides whose structure or function may or may not be altered. Any given 
natural or recombinant gene may have none, one, or many allelic forms. Common mutational 
changes which give rise to allelic variants arc generally ascribed to natural deletions, additions, or 
substitutions of nucleotides. Each of these types of changes may occur alone, or in combination 
with the others, one or more times in a given sequence. 

30 "Altered" nucleic acid sequences encoding RNAAP include those sequences with 

deletions, insertions, or substitutions of different nucleotides, resulting in a polynucleotide the 
same as RNAAP or a polypeptide with at least one functional characteristic of RNAAP. Included 
within this definition are polymorphisms which may or may not be readily detectable using a 
particular oligonucleotide probe of the polynucleotide encoding RNAAP, and improper or 
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unexpected hybridization to allelic variants, with a locus other than the normal chromosomal locus 
for the polynucleotide sequence encoding RNAAP. The encoded protein may also be "altered," 
and may contain deletions, insertions, or substitutions of amino acid residues which produce a 
silent change and result in a functionally equivalent RNAAP. Deliberate amino acid substitutions 
5 may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity, and/or the amphipathic nature of the residues, as long as the biological or 
immunological activity of RNAAP is retained. For example, negatively charged amino acids may 
include aspartic acid and glutamic acid, positively charged amino acids may include lysine and 
arginine, and amino acids with uncharged polar head groups having similar hydrophilicity values 

10 may include leucine, isoleucine, and valine; glycine and alanine; asparagine and glutamine; serine 
and threonine; and phenylalanine and tyrosine. 

The terms "'amino acid'' and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or 
synthetic molecules. In this context, ''fragments," "immunogenic fragments," or "antigenic 

15 fragments" refer to fragments of RNAAP which are preferably at least 5 to about 15 amino acids 
in length, most preferably at least 14 amino acids, and which retain some biological activity or 
immunological activity of RNAAP. Where "amino acid sequence" is recited to refer to an amino 
acid sequence of a naturally occurring protein molecule, "amino acid sequence" and like terms are 
not meant to limit the amino acid sequence to the complete native amino acid sequence associated 

20 with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PGR) technologies well 
known in the art. 

The term "antagonist refers to a molecule which, when bound to RNAAP, decreases the 
25 amount or the duration of the effect of the biological or immunological activity of RNAAP. 

Antagonists may include proteins, nucleic acids, carbohydrates, antibodies, or any other molecules 
which decrease the effect of RNAAP. 

The term "antibody" refers to intact molecules as well as to fragments thereof, such as 
Fab, F(ab')2, and Fv fragments, which are capable of binding the epitopic determinant. Antibodies 
30 that bind RNAAP polypeptides can be prepared using intact polypeptides or using fragments 

containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide 
used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of 
RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly 
used carriers that are chemically coupled to peptides include bovine serum albumin, thyroglobulin. 
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and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animaL 

The term "antigenic determinant" refers to that fragment of a molecule (i.e., an epitope) 
that makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
5 which bind specifically to antigenic determinants (given regions or three-dimensional structures on 
the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "antisense'' refers to any composition containing a nucleic acid sequence which 
is complementary to the "sense" strand of a specific nucleic acid sequence. Antisense molecules 
10 may be produced by any method including synthesis or transcription. Once introduced into a cell, 
the complementary nucleotides combine with natural sequences produced by the cell to form 
duplexes and to block either transcription or translation. The designation "'negative" can refer to 
the antisense strand, and the designation "positive" can refer to the sense strand. 

The term "biologically active'' refers to a protein having structural, regulatory, or 
15 biochemical functions of a naturally occurring moiecuie. Likewise, "immunologically active" 
refers to the capability of the natural, recombinant, or synthetic RNAAP, or of any oligopeptide 
thereof, to induce a specific immune response in appropriate animals or cells and to bind with 
specific antibodies. 

The terms "complementary" and "complementarity" refer to the natural binding of 
20 polynucleotides by base pairing. For example, the sequence "5' A-G-T 3'" bonds to the 

complementary sequence "3' T-C-A 5'." Complementarity between two single-stranded molecules 
may be "partial," such that only some of the nucleic acids bind, or it may be "complete," such that 
total complementarity exists between the single stranded molecules. The degree of 
complementarity between nucleic acid strands has significant effects on the efficiency and strength 
25 of the hybridization between the nucleic acid strands. This is of particular importance in 

amplification reactions, which depend upon binding between nucleic acids strands, and in the 
design and use of peptide nucleic acid (PNA) molecules. 

A "composition comprising a given polynucleotide sequence" and a "composition 
comprising a given amino acid sequence" refer broadly to any composition containing the given 
30 polynucleotide or amino acid sequence. The composition may comprise a dry formulation or an 
aqueous solution. Compositions comprising polynucleotide sequences encoding RNAAP or 
fragments of RNAAP may be employed as hybridization probes. The probes may be stored in 
freeze-dried form and may be associated with a stabilizing agent such as a carbohydrate. In 
hybridizations, the probe may be deployed in an aqueous solution containing salts (e.g., NaCl), 
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detergents (e.g., sodium dodecyl sulfate; SDS), and other components (e.g., Denhardfs solution, 
dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been resequenced to 
resolve uncalled bases, extended using the XL-PCR kit (Perkin-EImer, Norwalk CT) in the 5' 
5 and/or the 3' direction, and resequenced, or which has been assembled from the overlapping 

sequences of more than one Incyte Clone using a computer program for fragment assembly, such 
as the GELVIEW fragment assembly system (GCG, Madison WI). Some sequences have been 
both extended and assembled to produce the consensus sequence. 

The term "correlates with expression of a polynucleotide'' indicates that the detection of 

10 the presence of nucleic acids, the same or related to a nucleic acid sequence encoding RNAAP, by 
northern analysis is indicative of the presence of nucleic acids encoding RNAAP in a sample, and 
thereby correlates with expression of the transcript from the polynucleotide encoding RNAAP. 

A "'deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

15 The term "derivative" refers to the chemical modification of a polypeptide sequence, or a 

polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for 
example, replacement of hydrogen by an alkyl, acyl, or amino group. A derivative polynucleotide 
encodes a polypeptide which retains at least one biological or immunological function of the 
natural molecule. A derivative polypeptide is one modified by glycosylation, pegylation, or any 

20 similar process that retains at least one biological or immunological function of the polypeptide 
from which it was derived. 

The term "similarity" refers to a degree of complementarity. There may be partial 
similarity or complete similarity. The word "identity'' may substitute for the word "similarity." A 
partially complementary sequence that at least partially inhibits an identical sequence from 

25 hybridizing to a target nucleic acid is referred to as "substantially similar." The inhibition of 

hybridization of the completely complementary sequence to the target sequence may be examined 
using a hybridization assay (Southern or northern blot, solution hybridization, and the like) under 
conditions of reduced stringency. A substantially similar sequence or hybridization probe will 
compete for and inhibit the binding of a completely similar (identical) sequence to the target 

30 sequence under conditions of reduced stringency. This is not to say that conditions of reduced 
stringency are such that non-specific binding is permitted, as reduced stringency conditions 
require that the binding of two sequences to one another be a specific (i.e., a selective) interaction. 
The absence of non-specific binding may be tested by the use of a second target sequence which 
lacks even a partial degree of complementarity (e.g., less than about 30% similarity or identity). 



11 



wo 00/11171 PCT/US99/19361 

In the absence of non-specific binding, the substantially similar sequence or probe will not 
hybridize to the second non-complementary target sequence. 

The phrases "percent identity'' and "% identity" refer to the percentage of sequence 
similarity found in a comparison of two or more amino acid or nucleic acid sequences. Percent 
5 identity can be determined electronically, e.g., by using the MEGALIGN program (DNASTAR, 
Madison Wl) which creates alignments between two or more sequences according to methods 
selected by the user, e.g., the clustal method. (See, e.g., Higgins, D.G. and P.M. Sharp (1988) 
Gene 73:237-244.) Parameters for each method may be the default parameters provided by 
MEGALIGN or may be specified by the user. The clustal algorithm groups sequences into 

10 clusters by examining the distances between all pairs. The clusters are aligned pairwise and then 
in groups. The percentage similarity between two amino acid sequences, e.g., sequence A and 
sequence B, is calculated by dividing the length of sequence A, minus the number of gap residues 
in sequence A, minus the number of gap residues in sequence B, into the sum of the residue 
matches between sequence A and sequence B, times one hundred. Gaps of low or of no similarity 

15 between the two amino acid sequences are not included in determining percentage similarity. 
Percent identity between nucleic acid sequences can also be counted or calculated by other 
methods known in the art, e.g., the Jotun Hein method. (See, e.g., Hein, J. (1990) Methods 
Enzymol. 183:626-645.) Identity between sequences can also be determined by other methods 
known in the art, e.g., by varying hybridization conditions. 

20 "Human artificial chromosomes" (HACs) are linear microchromosomes which may 

contain DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements 
required for stable mitotic chromosome segregation and maintenance. 

The term "humanized antibody" refers to antibody molecules in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 

25 resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to any process by which a strand of nucleic acid binds with a 
complementary strand through base pairing. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 

30 hybridization complex may be formed in solution (e.g., Qt or R^t analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a 
solid support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate 
substrate to which cells or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 
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sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively, 
to the sequence found in the naturally occurring molecule. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by 
5 expression of various factors, e.g., cytokines, chemokincs, and other signaling molecules, which 
may affect cellular and systemic defense systems. 

The term "microarray" refers to an arrangement of distinct polynucleotides on a substrate. 

The terms "element" and "array element" in a microarray context, refer to hybridizable 
polynucleotides arranged on the surface of a substrate. 
10 The term "modulate" refers to a change in the activity of RNAAP. For example, 

modulation may cause an increase or a decrease in protein activity, binding characteristics, or any 
other biological, functional, or immunological properties of RNAAP. 

The phrases "nucleic acid" or "nucleic acid sequence," as used herein, refer to a 
nucleotide, oligonucleotide, polynucleotide, or any fragment thereof These phrases also refer to 
15 DNA or RNA of genomic or synthetic origin which may be single-stranded or double- stranded 
and may represent the sense or the antisense strand, to peptide nucleic acid (PN A), or to any 
DNA-like or RNA-like material. In this context, "fragments" refers to those nucleic acid 
sequences which comprise a region of unique polynucleotide sequence that specifically identifies 
SEQ ID NO:26-50, for example, as distinct from any other sequence in the same genome. For 
20 example, a fragment of SEQ ID NO:26-50 is useful in hybridization and amplification 
technologies and in analogous methods that distinguish SEQ ID NO:26-50 from related 
polynucleotide sequences. A fragment of SEQ ID NO:26-50 is at least about 1 5-20 nucleotides in 
length. The precise length of the fragment of SEQ ID NO:26-50 and the region of SEQ ID 
NO:26-50 to which the fragment corresponds are routinely determinable by one of ordinary skill 
25 in the art based on the intended purpose for the fragment. In some cases, a fragment, when 

translated, would produce polypeptides retaining some functional characteristic, e.g., antigenicity, 
or structural domain characteristic, e.g., ATP-binding site, of the full-length polypeptide. 

The terms "operably associated" and "operably linked" refer to functionally related 
nucleic acid sequences, A promoter is operably associated or operably linked with a coding 
30 sequence if the promoter controls the translation of the encoded polypeptide. While operably 
associated or operably linked nucleic acid sequences can be contiguous and in the same reading 
frame, certain genetic elements, e.g., repressor genes, are not contiguously linked to the sequence 
encoding the polypeptide but still bind to operator sequences that control expression of the 
polypeptide. 
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The term "oligonucleotide" refers to a nucleic acid sequence of at least about 6 
nucleotides to 60 nucleotides, preferably about 15 to 30 nucleotides, and most preferably about 20 
to 25 nucleotides, which can be used in PCR amplification or in a hybridization assay or 
microarray. "Oligonucleotide" is substantially equivalent to the terms "amplimer," "primer," 
5 "oligomer," and "probe," as these terms are commonly defined in the art. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone 
of amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 

10 elongation, and may be pegylated to extend their lifespan in the cell. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
acids encoding RNAAP, or fragments thereof, or RNAAP itself, may comprise a bodily fluid; an 
extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic 
DNA, RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

15 The terms "specific binding" and "specifically binding" refer to that interaction between a 

protein or peptide and an agonist, an antibody, or an antagonist. The interaction is dependent upon 
the presence of a particular structure of the protein, e.g., the antigenic determinant or epitope, 
recognized by the binding molecule. For example, if an antibody is specific for epitope "A," the 
presence of a polypeptide containing the epitope A, or the presence of free unlabeled A, in a 

20 reaction containing free labeled A and the antibody will reduce the amount of labeled A that binds 
to the antibody. 

The term "stringent conditions" refers to conditions which permit hybridization between 
polynucleotides and the claimed polynucleotides. Stringent conditions can be defined by salt 
concentration, the concentration of organic solvent, e.g., formamide, temperature, and other 
25 conditions well known in the art. In particular, stringency can be increased by reducing the 
concentration of salt, increasing the concentration of formamide, or raising the hybridization 
temperature. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least about 60% 
30 free, preferably about 75% free, and most preferably about 90% free from other components with 
which they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acids or nucleotides by 
different amino acids or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
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chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

"lYansformation" describes a process by which exogenous DNA enters and changes a 
5 recipient cell. Transformation may occur under natural or artificial conditions according to 
various methods well known in the art, and may rely on any known method for the insertion of 
foreign nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method for 
transformation is selected based on the type of host cell being transformed and may include, but is 
not limited to, viral infection, electroporation, heat shock, lipofection, and particle bombardment. 

10 The term "transformed" ceils includes stably transformed cells in which the inserted DIM A is 
capable of replication either as an autonomously replicating plasmid or as part of the host 
chromosome, as well as transiently transformed cells which express the inserted DNA or RNA for 
limited periods of time. 

A "variant" of RNAAP polypeptides refers to an amino acid sequence that is altered by 

15 one or more amino acid residues. The variant may have ''conservative" changes, wherein a 

substituted amino acid has similar structural or chemical properties (e.g., replacement of leucine 
with isoleucine). More rarely, a variant may have "nonconscrvative" changes (e.g., replacement 
of glycine with tryptophan). Analogous minor variations may also include amino acid deletions or 
insertions, or both. Guidance in determining which amino acid residues may be substituted, 

20 inserted, or deleted without abolishing biological or immunological activity may be found using 
computer programs well known in the art, for example, LASERGENE software (DNASTAR). 

The term "variant," when used in the context of a polynucleotide sequence, may 
encompass a polynucleotide sequence related to RNAAP. This definition may also include, for 
example, "allelic" (as defined above), "splice," "species," or "polymorphic" variants. A splice 

25 variant may have significant identity to a reference molecule, but will generally have a greater or 
lesser number of polynucleotides due to alternate splicing of exons during mRNA processing. The 
corresponding polypeptide may possess additional functional domains or an absence of domains. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 
polypeptides generally will have significant amino acid identity relative to each other. A 

30 polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 
individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one base. The presence 
of SNPs may be indicative of, for example, a certain population, a disease state, or a propensity for 
a disease state. 
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THE INVENTION 

The invention is based on the discovery of new human RNA-associated proteins 
(RNAAP), the polynucleotides encoding RNAAP, and the use of these compositions for the 
diagnosis, treatment, or prevention of cell proliferative, autoimmune/inflammatory, and infectious 
5 disorders. 

Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 
RNAAP. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the 
polypeptide and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte 
clones in which nucleic acids encoding each RNAAP were identified, and column 4 shows the 

10 cDNA libraries from which these clones were isolated- Column 5 shows Incyte clones and their 
corresponding cDNA libraries. Clones for which cDNA libraries are not indicated were derived 
from pooled cDNA libraries. The clones in column 5 were used to assemble the consensus 
nucleotide sequence of each RNAAP and arc useful as fragments in hybridization technologies. 
The columns of Table 2 show various properties of each of the polypeptides of the 

15 invention: column 1 references the SEQ ID NO; column 2 shows the number of amino acid 

residues in each polypeptide; column 3 shows potential phosphorylation sites; column 4 shows 
potential glycosylation sites; column 5 shows the amino acid residues comprising signature 
sequences and motifs; column 6 shows the identity of each protein; and column 7 shows analytical 
methods used to identify each protein through sequence homology and protein motifs. 

20 The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 

associated with nucleotide sequences encoding RNAAP. The first column of Table 3 lists the 
nucleotide SEQ ID NOs. Column 2 lists tissue categories which express RNAAP as a fraction of 
total tissue categories expressing RNAAP. Column 3 lists diseases, disorders, or conditions 
associated with those tissues expressing RNAAP. Column 4 lists the vectors used to subclone the 

25 cDNA library. 

The columns of Table 4 show descriptions of the tissues used to construct the cDNA 
libraries from which cDNA clones encoding RNAAP were isolated. Column 1 references the 
nucleotide SEQ ID NOs, column 2 shows the cDNA libraries from which these clones were 
isolated, and column 3 shows the tissue origins and other descriptive information relevant to the 
30 cDNA libraries in column 2. 

The following fragments of the nucleotide sequences encoding RNAAP are useful, for 
example, in hybridization or amplification technologies to identify SEQ ID NO:26-50 and to 
distinguish between SEQ ID NO:26-50 and related polynucleotide sequences. The useful 
fragments include the fragment of SEQ ID NO:26 from about nucleotide 586 to about nucleotide 
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615; the fragment of SEQ ID NO:27 from about nucleotide 399 to about nucleotide 428; the 
fragment of SEQ ID NO:28 from about nucleotide 234 to about nucleotide 263; the fragment of 
SEQ ID NO:29 from about nucleotide 40 to about nucleotide 69; and the fragment of SEQ ID 
NO:30 from about nucleotide 20 to about nucleotide 49, the fragment of SEQ ID NO:3 1 from 
5 about nucleotide 40 to about nucleotide 80, the fragment of SEQ ID NO:32 from about nucleotide 
672 to about nucleotide 713, the fragment of SEQ ID NO:33 from about nucleotide 226 to about 
nucleotide 276, the fragment of SEQ ID NO:34 from about nucleotide 7 1 9 to about nucleotide 
761, the fragments of SEQ ID NO:35 from about nucleotide 77 to about nucleotide 167 and from 
about nucleotide 168 to about nucleotide 259, the fragment of SEQ ID NO:36 from about 

10 nucleotide 465 to about nucleotide 506, the fragment of SEQ ID NO:37 from about nucleotide 76 
to about nucleotide 1 17, the fragment of SEQ ID NO:38 from about nucleotide 136 to about 
nucleotide 180, the fragments of SEQ ID NO;39 from about nucleotide 215 to about nucleotide 
262, from about nucleotide 683 to about nucleotide 727, and from about nucleotide 1805 to about 
nucleotide 1852, the fragment of SEQ IDNO:40 from about nucleotide 162 to about nucleotide 

15 206, the fragment of SEQ IDNO:41 from about nucleotide 379 to about nucleotide 423, the 

fragment of SEQ ID NO:42 from about nucleotide 164 to about nucleotide 208, the fragment of 
SEQ ID NO:43 from about nucleotide 1 to about nucleotide 42, the fragments of SEQ ID NO:44 
from about nucleotide 249 to about 296 and from about nucleotide 816 to about nucleotide 862, 
the fragment of SEQ ID NO:45 from about nucleotide 196 to about nucleotide 240, the fragment 

20 of SEQ ID NO:46 from about nucleotide 1 to about nucleotide 54, the fragment of SEQ ID NO:47 
from about nucleotide 463 to about nucleotide 507, the fragments of SEQ ID NO:48 from about 
nucleotide 55 1 to about nucleotide 595, from about nucleotide 866 to about nucleotide 910, and 
from about nucleotide 1406 to about nucleotide 1450, the fragments of SEQ ID NO:49 from about 
nucleotide 218 to about nucleotide 263, from about nucleotide 758 to about nucleotide 802, and 

25 from about nucleotide 1 190 to about nucleotide 1234, and the fragment of SEQ ID NO:50 from 
about nucleotide 1 1 to about nucleotide 70. 

The invention also encompasses RNAAP variants. A preferred RNAAP variant is one 
which has at least about 80%, more preferably at least about 90%, and most preferably at least 
about 95% amino acid sequence identity to the RNAAP amino acid sequence, and which contains 

30 at least one functional or structural characteristic of RNAAP. 

The invention also encompasses polynucleotides which encode RNAAP. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence 
selected from the group consisting of SEQ ID NO:26-50, which encodes RNAAP. 

The invention also encompasses a variant of a polynucleotide sequence encoding RNAAP. 
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In particular, such a variant polynucleotide sequence will have at least about 70%, more preferably 
at least about 85%, and most preferably at least about 95% polynucleotide sequence identity to the 
polynucleotide sequence encoding RNAAP. A particular aspect of the invention encompasses a 
variant of a polynucleotide sequence comprising a sequence selected from the group consisting of 
5 SEQ ID NO:26-50 which has at least about 70%, more preferably at least about 85%, and most 
preferably at least about 95% polynucleotide sequence identity to a nucleic acid sequence selected 
from the group consisting of SEQ ID NO:26-50. Any one of the polynucleotide variants described 
above can encode an amino acid sequence which contains at least one functional or structural 
characteristic of RNAAP. 

10 It will be appreciated by those skilled in the art that as a result of the degeneracy of the 

genetic code, a multitude of polynucleotide sequences encoding RNAAP, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 

15 combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring RNAAP, and all such variations are to be 
considered as being specifically disclosed. 

Although nucleotide sequences which encode RNAAP and its variants are preferably 
capable of hybridizing to the nucleotide sequence of the naturally occurring RNAAP under 

20 appropriately selected conditions of stringency, it may be advantageous to produce nucleotide 
sequences encoding RNAAP or its derivatives possessing a substantially different codon usage, 
e.g., inclusion of non-naturally occurring codons. Codons may be selected to increase the rate at 
which expression of the peptide occurs in a particular prokaryotic or eukaryotic host in accordance 
with the frequency with which particular codons are utilized by the host. Other reasons for 

25 substantially altering the nucleotide sequence encoding RNAAP and its derivatives without 

altering the encoded amino acid sequences include the production of RNA transcripts having more 
desirable properties, such as a greater half-life, than transcripts produced from the naturally 
occurring sequence. 

The invention also encompasses production of DNA sequences which encode RNAAP 
30 and RNAAP derivatives, or fragments thereof, entirely by synthetic chemistry. After production, 
the synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a sequence encoding RNAAP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
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hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:26-50 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. 
and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 
1 52:507-5 1 I .) For example, stringent salt concentration will ordinarily be less than about 750 mM 
5 NaCl and 75 mM trisodium citrate, preferably less than about 500 mM NaCl and 50 mM trisodium 
citrate, and most preferably less than about 250 mM NaCl and 25 mM trisodium citrate. Low 
stringency hybridization can be obtained in the absence of organic solvent, e.g., formamide, while 
high stringency hybridization can be obtained in the presence of at least about 35% formamide, 
and most preferably at least about 50% formamide. Stringent temperature conditions will 

10 ordinarily include temperatures of at least about 30'^C, more preferably of at least about 37°C, and 
most preferably of at least about 42°C. Varying additional parameters, such as hybridization time, 
the concentration of detergent, e.g., sodium dodecyl sulfate (SDS), and the inclusion or exclusion 
of carrier DNA, are well known to those skilled in the art. Various levels of stringency are 
accomplished by combining these various conditions as needed. In a preferred embodiment, 

15 hybridization will occur at 30''C in 750 mM NaCl, 75 mM trisodium citrate, and 1% SDS. In a 
more preferred embodiment, hybridization will occur at 37°C in 500 mM NaCl, 50 mM trisodium 
citrate, 1% SDS, 35% formamide, and 100 /^g/ml denatured salmon sperm DNA (ssDNA). In a 
most preferred embodiment, hybridization will occur at 42^C in 250 mM NaCI, 25 mM trisodium 
citrate, 1% SDS, 50 % formamide, and 200 /ug/m \ ssDNA, Useful variations on these conditions 

20 will be readily apparent to those skilled in the art. 

The washing steps which follow hybridization can also vary in stringency. Wash 
stringency conditions can be defined by salt concentration and by temperature. As above, wash 
stringency can be increased by decreasing salt concentration or by increasing temperature. For 
example, stringent salt concentration for the wash steps will preferably be less than about 30 mM 

25 NaCl and 3 mM trisodium citrate, and most preferably less than about 15 mM NaCI and 1.5 mM 
trisodium citrate. Stringent temperature conditions for the wash steps will ordinarily include 
temperature of at least about 25°C, more preferably of at least about 42°C, and most preferably of 
at least about 68°C. In a preferred embodiment, wash steps will occur at 25^C in 30 mM NaCI, 3 
mM trisodium citrate, and 0.1% SDS. In a more preferred embodiment, wash steps will occur at 

30 42'^C in 15 mM NaCl, 1.5 mM trisodium citrate, and 0.1% SDS. In a most preferred embodiment, 
wash steps will occur at 68°C in 15 mM NaCl, 1.5 mM trisodium citrate, and 0.1% SDS. 
Additional variations on these conditions will be readily apparent to those skilled in the art. 

Methods for DNA sequencing are well known in the art and may be used to practice any 
of the embodiments of the invention. The methods may employ such enzymes as the Klenow 
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fragment of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase 
(Perkin-EImcr), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases and proofreading exonucleases such as those found in the 
ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence 
5 preparation is automated with machines such as the Hamilton MICROLAB 2200 (Hamilton, Reno 
NV), Peltier Thennal Cycler 200 (PTC200; MJ Research, Watertown MA) and the ABI 
CATALYST 800 (Perkin-Elmer). Sequencing is then carried out using either ABI 373 or 377 
DNA sequencing systems (Perkin-Elmer), the MEGABACE 1000 DNA sequencing system 
(Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The resulting 
10 sequences arc analyzed using a variety of algorithms which are well known in the art. (See, e.g., 
Ausubcl, P.M. (1997) Short Protocols in Molecular Biology , John Wiley & Sons, New York NY, 
unit 7.7; Meyers, R.A. (1995) Molecular Biology and Biotechnology , Wiley VCH, New York NY, 
pp. 856-853.) 

The nucleic acid sequences encoding RNAAP may be extended utilizing a partial 

15 nucleotide sequence and employing various PCR-bascd methods known in the art to detect 

upstream sequences, such as promoters and regulatory elements. For example, one method which 
may be employed, restriction-site PCR. uses universal and nested primers to amplify unknown 
sequence from genomic DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods 
Appiic. 2:3 1 8-322.) Another method, inverse PCR, uses primers that extend in divergent 

20 directions to amplify unknown sequence from a circularized template. The template is derived 
from restriction fragments comprising a known genomic locus and surrounding sequences. (See, 
e.g., Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186.) A third method, capture PCR, involves 
PCR amplification of DNA fragments adjacent to known sequences in human and yeast artificial 
chromosome DNA. (See, e.g., Lagerstrom, M. et al. ( 1 991 ) PCR Methods Appiic. LI 11-1 19.) In 

25 this method, multiple restriction enzyme digestions and ligations may be used to insert an 

engineered double-stranded sequence into a region of unknown sequence before performing PCR. 
Other methods which may be used to retrieve unknown sequences are known in the art. (See, e.g., 
Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-306). Additionally, one may use PCR, 
nested primers, and PROMOTERFINDER libraries (Clontech, Palo Alto CA) to walk genomic 

30 DNA. This procedure avoids the need to screen libraries and is useful in fmding intron/exon 
junctions. For all PCR-based methods, primers may be designed using commercially available 
software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, Plymouth MN) or 
another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of 
about 50% or more, and to anneal to the template at temperatures of about 68*^C to 72^C. 
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When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of 
5 sequence into 5' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to 
analyze the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, 
capillary sequencing may employ flowable polymers for electrophoretic separation, four different 
nucleotide-specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for 

10 detection of the emitted wavelengths. Output/light intensity may be converted to electrical signal 
using appropriate software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Perkin-Eimer), 
and the entire process from loading of samples to computer analysis and electronic data display 
may be computer controlled. Capillary electrophoresis is especially preferable for sequencing 
small DNA fragments which may be present in limited amounts in a particular sample. 

15 In another embodiment of the invention, polynucleotide sequences or fragments thereof 

which encode RNAAP may be cloned in recombinant DNA molecules that direct expression of 
RNAAP, or fragments or functional equivalents thereof, in appropriate host cells. Due to the 
inherent degeneracy of the genetic code, other DNA sequences which encode substantially the 
same or a functionally equivalent amino acid sequence may be produced and used to express 

20 RNAAP, 

The nucleotide sequences of the present invention can be engineered using methods 
generally known in the art in order to alter RNAAP-encoding sequences for a variety of purposes 
including, but not limited to, modification of the cloning, processing, and/or expression of the 
gene product. DNA shuffling by random fragmentation and PCR reassembly of gene fragments 
25 and synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, 

oligonucleotide-mediatcd site-directed mutagenesis may be used to introduce mutations that create 
new restriction sites, alter glycosyiation patterns, change codon preference, produce splice 
variants, and so forth. 

In another embodiment, sequences encoding RNAAP may be synthesized, in whole or in 
30 part, using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) NucL 
Acids Res. Symp. Ser. 215-223, and Horn, T. et al. (1980)Nucl. Acids Res. Symp. Ser. 225-232.) 
Alternatively, RNAAP itself or a fragment thereof may be synthesized using chemical methods. 
For example, peptide synthesis can be performed using various solid-phase techniques. (See, e.g., 
Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved using 
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the ABI 431 A Peptide Synthesizer (Perkin-Elmer). Additionally, the amino acid sequence of 
RNAAP, or any part thereof, may be altered during direct synthesis and/or combined with 
sequences from other proteins, or any part thereof, to produce a variant polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
5 chromatography. (See, e.g, Chiez, R.M. and F.Z. Regnier (1 990) Methods Enzymol. 182:392- 
42 1 .) The composition of the synthetic peptides may be confirmed by amino acid analysis or by 
sequencing. (See, e.g., Creighton, T. (1984) Proteins, Structures and Moiecular Properties . WH 
Freeman, New York NY.) 

In order to express a biologically active RNAAP, the nucleotide sequences encoding 

10 RNAAP or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector 
which contains the necessary elements for transcriptional and transiational control of the inserted 
coding sequence in a suitable host. These elements include regulatory sequences, such as 
enhancers, constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector 
and in polynucleotide sequences encoding RNAAP. Such elements may vary in their strength and 

15 specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding RNAAP. Such signals include the ATG initiation codon and adjacent 
sequences, e.g, the Kozak sequence. In cases where sequences encoding RNAAP and its initiation 
codon and upstream regulatory sequences are inserted into the appropriate expression vector, no 
additional transcriptional or transiational control signals may be needed. However, in cases where 

20 only coding sequence, or a fragment thereof, is inserted, exogenous transiational control signals 
including an in-frame ATG initiation codon should be provided by the vector. Exogenous 
transiational elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for the 
particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 

25 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct 
expression vectors containing sequences encoding RNAAP and appropriate transcriptional and 
transiational control elements. These methods include in vitro recombinant DNA techniques, 
synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) 
30 Molecular Cloning, A Laboratory Manual , Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 
16-17; Ausubel, P.M. et al. (1995) Current Protocols in Molecular Biology , John Wiley & Sons, 
New York NY, ch. 9, 1 3, and 1 6.) 

A variety of expression vector/host systems may be utilized to contain and express 
sequences encoding RNAAP. These include, but are not limited to, microorganisms such as 
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bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression 
vectors; yeast transformed with yeast expression vectors; insect cell systems infected with viral 
expression vectors (e.g., baculovirus); plant cell systems transformed with viral expression vectors 
(e.g., cauliflower mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial 
5 expression vectors (e.g., Ti or pBR322 plasm ids); or animal cell systems. The invention is not 
limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected 
depending upon the use intended for polynucleotide sequences encoding RNAAP. For example, 
routine cloning, subcloning, and propagation of polynucleotide sequences encoding RNAAP can 

10 be achieved using a multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolia 
CA) or pSPORTl plasmid (Life Technologies). Ligation of sequences encoding RNAAP into the 
vector's multiple cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure 
for identification of transfonncd bacteria containing recombinant molecules. In addition, these 
vectors may be useful for in vitro transcription, didcoxy sequencing, single strand rescue with 

15 helper phage, and creation of nested deletions in the cloned sequence. (See, e.g.. Van Heeke, G. 
and S.M. Schuster (1989) J. Biol. Chcm. 264:5503-5509.) When large quantities of RNAAP are 
needed, e.g. for the production of antibodies, vectors which direct high level expression of 
RNAAP may be used. For example, vectors containing the strong, inducible T5 or T7 
bacteriophage promoter may be used. 

20 Yeast expression systems may be used for production of RNAAP. A number of vectors 

containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomyces cerevisiae or Pichia pastoris . In addition, 
such vectors direct either the secretion or intracellular retention of expressed proteins and enable 
integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 

25 1995, supra : Grant et al. (1987) Methods Enzymol. 153:516-54; and Scorer, C. A. ct al. (1994) 
Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of RNAAP. Transcription of sequences 
encoding RNAAP may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) 

30 EMBO J. 6:307-31 1). Alternatively, plant promoters such as the small subunit of RUBISCO or 
heat shock promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; 
Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell 
Differ. 17:85-105.) These constructs can be introduced into plant cells by direct DNA 
transformation or pathogen-mediated transfection. (See, e.g.. The McGraw Hill Yearbook of 
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Science and Technology (1992) McGraw Hill, New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding RNAAP may be iigated 
into an adenovirus transcription/translation complex consisting of the late promoter and tripartite 
5 leader sequence. Insertion in a non-essential EI or E3 region of the viral genome may be used to 
obtain infective virus which expresses RNAAP in host ceils, (See, e.g., Logan, J. and T. Shenk 
(1984) Proc. Natl. Acad. Sci. 81:3655-3659.) In addition, transcription enhancers, such as the 
Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. 
SV40 or EBV-based vectors may also be used for high-level protein expression. 
10 Human artificial chromosomes (HACs) may also be employed to deliver larger fragments 

of DNA than can be contained in and expressed from a plasm id. HACs of about 6 kb to 10 Mb 
are constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al, (1 997) Nat Genet. 
15:345-355.) 

15 For long term production of recombinant proteins in mammalian systems, stable 

expression of RNAAP in cell lines is preferred. For example, sequences encoding RNAAP can be 
transformed into cell lines using expression vectors which may contain viral origins of replication 
and/or endogenous expression elements and a selectable marker gene on the same or on a separate 
vector. Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 

20 days in enriched media before being switched to selective media. The purpose of the selectable 
marker is to confer resistance to a selective agent, and its presence allows growth and recovery of 
cells which successfully express the introduced sequences. Resistant clones of stably transformed 
cells may be propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 

25 include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 

phosphoribosyltransferase genes, for use in tk' or apr' cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 11:223-232; Lowy, L et al. (1980) Cell 22:817-823.) Also, antimetabolite, 
antibiotic, or herbicide resistance can be used as the basis for selection. For example, dhfr confers 
resistance to methotrexate; neo confers resistance to the aminoglycosides neomycin and G-4I8; 

30 and als or pat confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, 
respectively. (See, e.g., Wigler, M. et al, (1980) Proc. Natl, Acad. Sci. 77:3567-3570; 
Colbere-Garapin, F. et al. ( 1 98 1 ) J. Mol. Biol. 1 50: 1-14.) Additional selectable genes have been 
described, e.g., trpB and hisD, which alter cellular requirements for metabolites. (See, e.g., 
Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci, 85:8047-805 1 .) Visible markers. 
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e.g., anthocyanins, green fluorescent proteins (GFP; Ciontech), B glucuronidase and its substrate 
B-glucuronide, or luciferase and its substrate lucifcrin may be used. These markers can be used 
not only to identify transformants^ but also to quantify the amount of transient or stable protein 
expression attributable to a specific vector system. (See, e.g., Rhodes, C.A. (1995) Methods Mol. 
5 Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of 
interest is also present, the presence and expression of the gene may need to be confirmed. For 
example, if the sequence encoding RNAAP is inserted within a marker gene sequence, 
transformed cells containing sequences encoding RNAAP can be identified by the absence of 
10 marker gene function. Alternatively, a marker gene can be placed in tandem with a sequence 
encoding RNAAP under the control of a single promoter. Expression of the marker gene in 
response to induction or selection usually indicates expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding RNAAP and that 
express RNAAP may be identified by a variety of procedures known to those of skill in the art, 
15 These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PGR 
amplification, and protein bioassay or immunoassay techniques which include membrane, 
solution, or chip based technologies for the detection and/or quantification of nucleic acid or 
protein sequences. 

Immunological methods for detecting and measuring the expression of RNAAP using 
20 cither specific polyclonal or monoclonal antibodies are known in the art. Examples of such 

techniques include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), 
and fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay 
utilizing monoclonal antibodies reactive to two non-interfering epitopes on RNAAP is preferred, 
but a competitive binding assay may be employed. These and other assays are well known in the 
25 art. (See, e.g., Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual . APS Press, 
St Paul MN, Sect. IV; Coligan, J. E. et al. (1997) Current Protocols in Immunology , Greene Pub. 
Associates and Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunochemical 
Protocols , Humana Press, Totowa NJ). 

A wide variety of labels and conjugation techniques are known by those skilled in the art 
30 and may be used in various nucleic acid and amino acid assays. Means for producing labeled 

hybridization or PGR probes for detecting sequences related to polynucleotides encoding RNAAP 
include oligolabeling, nick translation, end-labeling, or PGR amplification using a labeled 
nucleotide. Alternatively, the sequences encoding RNAAP, or any fragments thereof, may be 
cloned into a vector for the production of an mRNA probe. Such vectors are known in the art, are 
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commercially available, and may be used to synthesize RNA probes in vitro by addition of an 
appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures 
may be conducted using a variety of commercially available kits, such as those provided by 
Amersham Pharmacia Biotech, Promega (Madison WI), and US Biochemical, Suitable reporter 
5 molecules or labels which may be used for ease of detection include radionuclides, enzymes, 

fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 
magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding RNAAP may be cultured 
under conditions suitable for the expression and recovery of the protein from cell culture. The 

10 protein produced by a transformed cell may be secreted or retained intracellularly depending on 
the sequence and/or the vector used. As will be understood by those of skill in the art, expression 
vectors containing polynucleotides which encode RNAAP may be designed to contain signal 
sequences which direct secretion of RNAAP through a prokaryotic or eukaryotic cell membrane. 
In addition, a host cell strain may be chosen for its ability to modulate expression of the 

15 inserted sequences or to process the expressed protein in the desired fashion. Such modifications 
of the polypeptide include, but are not limited to, acetyiation, carboxylation, giycosylation, 
phosphorylation, lipidation, and acylation, Post-translational processing which cleaves a "prepro" 
form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Different host cells which have specific cellular machinery and characteristic mechanisms for 

20 post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38), are available from 
the American Type Culture Collection (ATCC, Manassas, VA) and may be chosen to ensure the 
correct modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding RNAAP may be ligated to a heterologous sequence resulting in translation of 

25 a fusion protein in any of the aforementioned host systems. For example, a chimeric RNAAP 
protein containing a heterologous moiety that can be recognized by a commercially available 
antibody may facilitate the screening of peptide libraries for inhibitors of RNAAP activity. 
Heterologous protein and peptide moieties may also facilitate purification of fusion proteins using 
commercially available affinity matrices. Such moieties include, but are not limited to, glutathione 

30 S-transferase (GST), maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding 

peptide (CBP), 6-His, FLAG, c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His 
enable purification of their cognate fusion proteins on immobilized glutathione, maltose, 
phenylarsine oxide, calmodulin, and metal-chelate resins, respectively. FLAG, c-myc^ and 
hemagglutinin (HA) enable immunoaffinity purification of fusion proteins using commercially 
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available monoclonal and polyclonal antibodies that specifically recognize these epitope tags. A 
fusion protein may also be engineered to contain a proteolytic cleavage site located between the 
RNAAP encoding sequence and the heterologous protein sequence, so that RNAAP may be 
cleaved away from the heterologous moiety following purification. Methods for fusion protein 
5 expression and purification are discussed in Ausubel (1995, supra , ch 10). A variety of 

commercially available kits may also be used to facilitate expression and purification of fusion 
proteins. 

In a further embodiment of the invention, synthesis of radiolabeled RNAAP may be 

achieved in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract systems 
10 (Promega). These systems couple transcription and translation of protein-coding sequences 

operably associated with the T7, T3, or SP6 promoters. Translation takes place in the presence of 

a radiolabeled amino acid precursor, preferably ^^S-methionine. 

Fragments of RNAAP may be produced not only by recombinant production, but also by 

direct peptide synthesis using solid-phase techniques. (See, e.g., Creighton, supra, pp. 55-60.) 
15 Protein synthesis may be performed by manual techniques or by automation. Automated synthesis 

may be achieved, for example, using the ABI 43 1 A Peptide Synthesizer (Perkin-Elmer). Various 

fragments of RNAAP may be synthesized separately and then combined to produce the full length 

molecule. 

THERAPEUTICS 

20 Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 

between regions of RNAAP and RNA-associated proteins. In addition, the expression of RNAAP 
is closely associated with cancer, fetal development, cell proliferation, inflammation, and immune 
response. Therefore, RNAAP appears to play a role in cell proliferative, 

autoimmune/inflammatory, and infectious disorders. In the treatment of disorders associated with 
25 increased RNAAP expression or activity, it is desirable to decrease the expression or activity of 
RNAAP. In the treatment of the above conditions associated with decreased RNAAP expression 
or activity, it is desirable to increase the expression or activity of RNAAP. 

Therefore, in one embodiment, RNAAP or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
30 activity of RNAAP. Examples of such disorders include, but are not limited to, a cell proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, 
mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 
polycythemia vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers 
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of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 
gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 
prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an 
autoimmune/inflammatory disorder such as acquired immunodeficiency syndrome (AIDS), 
5 Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, 
amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune 
thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), 
bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 
diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis 

10 fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, 
Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple 
sclerosis, myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 

1 5 thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 
helminthic infections, and trauma; and an infectious disorder such as infections by viral agents 
classified as adenovirus, arenavirus, bunyavirus, calicivirus, coronavirus, filovirus, hepadnavirus, 
herpesvirus, flavivirus, orthomyxovirus, parvovirus, papovavirus, paramyxovirus, picornavirus, 

20 poxvirus, reovirus, retrovirus, rhabdovirus, and togavirus; infections by bacterial agents classified 
as pneumococcus, staphylococcus, streptococcus, bacillus, corynebacterium, Clostridium, 
meningococcus, gonococcus, listeria, moraxella, kingella, haemophilus, legionella, bordetella, 
gram-negative enterobacterium including shigella, salmonella, and Campylobacter, pseudomonas, 
vibrio, brucella, francisella, yersinia, bartonella, norcardium, actinomyces, mycobacterium, 

25 spirochaetalc, rickettsia, chlamydia, and mycoplasma; infections by fungal agents classified as 

aspergillus, blastomyces, dermatophytes, cryptococcus, coccidioides, malasezzia, histoplasma, and 
other fungal agents causing various mycoses; and infections by parasites classified as plasmodium 
or malaria-causing, parasitic entamoeba, leishmania, trypanosoma, toxoplasma, Pneumocystis 
carinii, intestinal protozoa such as giardia, trichomonas, tissue nematodes such as trichinella, 

30 intestinal nematodes such as ascaris, lymphatic filarial nematodes, trematodes such as 
schistosoma, and cestodes (tapeworm). 

In another embodiment, a vector capable of expressing RNAAP or a fragment or 
derivative thereof may be administered to a subject to treat or prevent a disorder associated with 
decreased expression or activity of RNAAP including, but not limited to, those described above. 
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In a further embodiment, a phamiaceutical composition comprising a substantially 
purified RNAAP in conjunction with a suitable pharmaceutical carrier may be administered to a 
subject to treat or prevent a disorder associated with decreased expression or activity of RNAAP 
including, but not limited to, those provided above. 
5 In still another embodiment, an agonist which modulates the activity of RNAAP may be 

administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of RNAAP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of RNAAP may be administered to a subject to 
treat or prevent a disorder associated with increased expression or activity of RNAAP. Examples 
10 of such disorders include, but are not limited to, those cell proliferative, 

autoimmune/inflammatory, and infectious disorders described above. In one aspect, an antibody 
which specifically binds RNAAP may be used directly as an antagonist or indirectly as a targeting 
or delivery mechanism for bringing a pharmaceutical agent to cells or tissue which express 
RNAAP. 

15 In an additional embodiment, a vector expressing the complement of the polynucleotide 

encoding RNAAP may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of RNAAP including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, 
complementary sequences, or vectors of the invention may be administered in combination with 

20 other appropriate therapeutic agents. Selection of the appropriate agents for use in combination 
therapy may be made by one of ordinary skill in the art, according to conventional pharmaceutical 
principles. The combination of therapeutic agents may act synergistically to effect the treatment 
or prevention of the various disorders described above. Using this approach, one may be able to 
achieve therapeutic efficacy with lower dosages of each agent, thus reducing the potential for 

25 adverse side effects. 

An antagonist of RNAAP may be produced using methods which are generally known in 
the art. In particular, purified RNAAP may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind RNAAP. Antibodies to RNAAP 
may also be generated using methods that are well known in the art. Such antibodies may include, 

30 but are not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies. Fab 

fragments, and fragments produced by a Fab expression library. Neutralizing antibodies (i.e., 
those which inhibit dimer formation) are especially preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, 
humans, and others may be immunized by injection with RNAAP or with any fragment or 
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oligopeptide thereof which has immunogenic properties. Depending on the host species, various 
adjuvants may be used to increase immunological response. Such adjuvants include, but are not 
limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances such 
as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. 
5 Among adjuvants used in humans, BCG (bacilli Caimette-Guerin) and Corvnebacterium parvum 
are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
RNAAP have an amino acid sequence consisting of at least about 5 amino acids, and, more 
preferably, of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, 

10 or fragments are identical to a portion of the amino acid sequence of the natural protein and 

contain the entire amino acid sequence of a small, naturally occurring molecule. Short stretches of 
RNAAP amino acids may be fused with those of another protein, such as KLH, and antibodies to 
the chimeric molecule may be produced. 

Monoclonal antibodies to RNAAP may be prepared using any technique which provides 

15 for the production of antibody molecules by continuous cell lines in culture. These include, but 
are not limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV- 
hybridoma technique. (See, e.g., Kohler, G. et a!. ( 1 975) Nature 256:495-497; Kozbor, D. et al. 
(1985) J. Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. 
80:2026-2030; and Cole, S.P, et al. (1984) Mol. Cell Biol. 62:109-120.) 

20 In addition, techniques developed for the production of "chimeric antibodies,'' such as the 

splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) 
Proc. Natl. Acad. Sci. 81:6851-6855; Neuberger, M.S. et al. ( 1 984) Nature 3 12:604-608; and 
Takeda, S. et aL (1985) Nature 314:452-454.) Alternatively, techniques described for the 

25 production of single chain antibodies may be adapted, using methods known in the art, to produce 
RNAAP-specific single chain antibodies. Antibodies with related specificity, but of distinct 
idiotypic composition, may be generated by chain shuffling from random combinatorial 
immunoglobulin libraries. (See, e.g.. Burton D.R. (1991) Proc. Natl. Acad. Sci. 88:10134-10137.) 
Antibodies may also be produced by inducing in vivo production in the lymphocyte 

30 population or by screening immunoglobulin libraries or panels of highly specific binding reagents 
as disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad, Sci. 86: 
3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for RNAAP may also be 
generated. For example, such fragments include, but are not limited to, F(ab')2 fragments 
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produced by pepsin digestion of the antibody molecule and Fab fragments generated by reducing 
the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression libraries may be 
constructed to allow rapid and easy identification of monoclonal Fab fragments with the desired 
specificity. (See, e.g., Huse, W.D. et ai. ( 1 989) Science 246: 1 275- 1281.) 
5 Various immunoassays may be used for screening to identify antibodies having the 

desired specificity. Numerous protocols for competitive binding or immunoradiometric assays 
using either polyclonal or monoclonal antibodies with established specificities are well known in 
the art. Such immunoassays typically involve the measurement of complex formation between 
RNAAP and its specific antibody. A two-site, monoclonal-based immunoassay utilizing 

10 monoclonal antibodies reactive to two non-interfering RNAAP epitopes is preferred, but a 
competitive binding assay may also be employed (Pound, supra ). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
techniques may be used to assess the affinity of antibodies for RNAAP. Affinity is expressed as 
an association constant, K^, which is defined as the molar concentration of RNAAP-antibody 

1 5 complex divided by the molar concentrations of free antigen and free antibody under equilibrium 
conditions- The determined for a preparation of polyclonal antibodies, which are 
heterogeneous in their affinities for multiple RNAAP epitopes, represents the average affinity, or 
avidity, of the antibodies for RNAAP. The determined for a preparation of monoclonal 
antibodies, which are monospecific for a particular RNAAP epitope, represents a true measure of 

20 affinity. High-affinity antibody preparations with ranging from about IC* to 10'- L/mole are 
preferred for use in immunoassays in which the RNAAP-antibody complex must withstand 
rigorous manipulations. Low-affinity antibody preparations with ranging from about 10^ to 10^ 
L/mole are preferred for use in immunopurification and similar procedures which ultimately 
require dissociation of RNAAP, preferably in active form, from the antibody (Catty, D. (1988) 

25 Antibodies. Volume I: A Practical Approach , IRL Press, Washington, DC; Liddell, J. E. and 
Cryer, A. (1991) A Practical Guide to Monoclonal Antibodies , John Wiley & Sons, New York 
NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 
determine the quality and suitability of such preparations for certain downstream applications. For 
30 example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 

preferably 5-10 mg specific antibody/ml, is preferred for use in procedures requiring precipitation 
of RNAAP-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, 
and guidelines for antibody quality and usage in various applications, are generally available. 
(See, e.g.. Catty, supra , and Coligan et al. supra .) 
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In another embodiment of the invention, the polynucleotides encoding RNAAP, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the 
complement of the polynucleotide encoding RNAAP may be used in situations in which it would 
be desirable to block the transcription of the mRNA. In particular, cells may be transformed with 
5 sequences complementary to polynucleotides encoding RNAAP. Thus, complementary molecules 
or fragments may be used to modulate RNAAP activity, or to achieve regulation of gene function. 
Such technology is now well known in the art, and sense or antisense oligonucleotides or larger 
fragments can be designed from various locations along the coding or control regions of sequences 
encoding RNAAP. 

10 Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, 

or from various bacterial plasmids, may be used for delivery of nucleotide sequences to the 
targeted organ, tissue, or cell population. Methods which are well known to those skilled in the art 
can be used to construct vectors to express nucleic acid sequences complementary to the 
polynucleotides encoding RNAAP. (See, e.g., Sambrook, supra ; Ausubel, 1995, supra .) 

15 Genes encoding RNAAP can be turned off by transforming a cell or tissue with expression 

vectors which express high levels of a polynucleotide, or fragment thereof, encoding RNAAP. 
Such constructs may be used to introduce untranslatable sense or antisense sequences into a cell. 
Even in the absence of integration into the DNA, such vectors may continue to transcribe RNA 
molecules until they are disabled by endogenous nucleases. Transient expression may last for a 

20 month or more with a non-replicating vector, and may last even longer if appropriate replication 
elements arc part of the vector system. 

As mentioned above, modifications of gene expression can be obtained by designing 
complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5', or 
regulatory regions of the gene encoding RNAAP. Oligonucleotides derived from the transcription 

25 initiation site, e.g., between about positions -10 and +10 from the start site, are preferred. 

Similarly, inhibition can be achieved using triple helix base-pairing methodology. Triple helix 
pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently 
for the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic 
advances using triplex DNA have been described in the literature. (See, e.g.. Gee, J.E. et al. 

30 (1994) in Huber, B.E. and B.I. Carr, Molecular and Immunologic Approaches , Futura Publishing, 
Mt. KiscoNY, pp. 163-177.) A complementary sequence or antisense molecule may also be 
designed to block translation of mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage 
of RNA- The mechanism of ribozyme action involves sequence-specific hybridization of the 
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ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage. For 
example, engineered hammerhead motif ribozyme molecules may specifically and efficiently 
catalyze endonucleolytic cleavage of sequences encoding RNAAP. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
5 scanning the target molecule for ribozyme cleavage sites, including the following sequences: 
GUA, GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 
ribonucleotides, corresponding to the region of the target gene containing the cleavage site, may 
be evaluated for secondary structural features which may render the oligonucleotide inoperable. 
The suitability of candidate targets may also be evaluated by testing accessibility to hybridization 
10 with complementary oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be 
prepared by any method known in the art for the synthesis of nucleic acid molecules. These 
include techniques for chemically synthesizing oligonucleotides such as solid phase 
phosphoramidite chemical synthesis. Alternatively, RTvIA molecules may be generated by in vitro 
15 and in vivo transcription of DNA sequences encoding RNAAP. Such DNA sequences may be 

incorporated into a wide variety of vectors with suitable RNA polymerase promoters such as T7 or 
SP6. Alternatively, these cDNA constructs that synthesize complementary RNA, constitutively or 
inducibly, can be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
20 modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3* 
ends of the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as 
inosinc, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms 
25 of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognized by 
endogenous endonucleases. 

Many methods for introducing vectors into cells or tissues are available and equally 
suitable for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into 
stem cells taken from the patient and clonally propagated for autologous transplant back into that 
30 same patient. Delivery by transfection, by liposome injections, or by polycationic amino polymers 
may be achieved using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. 
(1997) Nature Biotechnology 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as dogs, cats, cows, horses, rabbits. 
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monkeys, and most preferably, humans. 

An additional embodiment of the invention relates to the administration of a 
pharmaceutical or sterile composition, in conjunction with a pharmaceuticaily acceptable carrier, 
for any of the therapeutic effects discussed above. Such pharmaceutical compositions may consist 
5 of RNAAP, antibodies to RNAAP, and mimetics, agonists, antagonists, or inhibitors of RNAAP. 
The compositions may be administered alone or in combination with at least one other agent, such 
as a stabilizing compound, which may be administered in any sterile, biocompatible 
pharmaceutical carrier including, but not limited to, saline, buffered saline, dextrose, and water. 
The compositions may be administered to a patient alone, or in combination with other agents, 

10 drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by any 
number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 
intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. 

15 In addition to the active ingredients, these pharmaceutical compositions may contain 

suitable pharmaceuticaliy-acceptable carriers comprising excipicnts and auxiliaries which 
facilitate processing of the active compounds into preparations which can be used 
pharmaceuticaily. Further details on techniques for formulation and administration may be found 
in the latest edition of Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). 

20 Pharmaceutical compositions for oral administration can be formulated using 

pharmaceuticaily acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, 
pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by 
the patient. 

25 Pharmaceutical preparations for oral use can be obtained through combining active 

compounds with solid excipient and processing the resultant mixture of granules (optionally, after 
grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired. Suitable 
excipients include carbohydrate or protein fillers, such as sugars, including lactose, sucrose, 
mannitol, and sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as 

30 methyl cellulose, hydroxypropylmethyl-cellulose, or sodium carboxymethyicellulose; gums, 
including arabic and tragacanth; and proteins, such as gelatin and collagen. If desired, 
disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl pyrroiidone, 
agar, and alginic acid or a salt thereof, such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated 
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sugar solutions, which may also contain gum arabic, talc, poiyvinyipyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or 
solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for 
product identification or to characterize the quantity of active compound, i.e., dosage. 
5 Pharmaceutical preparations which can be used orally include push-fit capsules made of 

gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or 
starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty 
10 oils, liquid, or liquid polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution. 
Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain 
substances which increase the viscosity of the suspension, such as sodium carboxymethyl 
15 cellulose, sorbitol, or dcxtran. Additionally, suspensions of the active compounds may be 

prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include 
fatty oils, such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate, triglycerides, or 
liposomes. Non-lipid polycationic amino polymers may also be used for delivery. Optionally, the 
suspension may also contain suitable stabilizers or agents to increase the solubility of the 
20 compounds and allow for the preparation of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art. 

The pharmaceutical compositions of the present invention may be manufactured in a 
manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, 
25 dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with many 
acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and 
succinic acid. Salts tend to be more soluble in aqueous or other protonic solvents than are the 
corresponding free base forms. In other cases, the preferred preparation may be a lyophilized 
30 powder which may contain any or all of the following: 1 mM to 50 mM histidine, 0.1% to 2% 

sucrose, and 2% to 7% mannitol, at a pH range of 4.5 to 5.5, that is combined with buffer prior to 
use. 

After pharmaceutical compositions have been prepared, they can be placed in an 
appropriate container and labeled for treatment of an indicated condition. For administration of 
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RNAAP, such labeling would include amount, frequency, and method of administration. 

Pharmaceutical compositions suitable for use in the invention include compositions 
wherein the active ingredients are contained in an effective amount to achieve the intended 
purpose. The determination of an effective dose is well within the capability of those skilled in the 
5 art. 

For any compound, the therapeutically effective dose can be estimated initially either in 
cell culture assays, e.g., of neoplastic cells or in animal models such as mice, rats, rabbits, dogs, or 
pigs. An animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determine useful doses and routes 

10 for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
RNAAP or fragments thereof, antibodies of RNAAP, and agonists, antagonists or inhibitors of 
RNAAP, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
determined by standard pharmaceutical procedures in cell cultures or with experimental animals, 

15 such as by calculating the ED50 (the dose therapeutically effective in 50% of the population) or 
LD50 (the dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic 
effects is the therapeutic index, which can be expressed as the LD50/ED50 ratio. Pharmaceutical 
compositions which exhibit large therapeutic indices arc preferred. The data obtained from cell 
culture assays and animal studies are used to formulate a range of dosage for human use. The 

20 dosage contained in such compositions is preferably within a range of circulating concentrations 
that includes the ED5Q with little or no toxicity. The dosage varies within this range depending 
upon the dosage form employed, the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of 

25 the active moiety or to maintain the desired effect. Factors which may be taken into account 
include the severity of the disease state, the general health of the subject, the age, weight, and 
gender of the subject, time and frequency of administration, drug combination(s), reaction 
sensitivities, and response to therapy. Long-acting pharmaceutical compositions may be 
administered every 3 to 4 days, every week, or biweekly depending on the half-life and clearance 

30 rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 /ug to 100,000 /^g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
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inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular 

cells, conditions, locations, etc. 

DIAGNOSTICS 

In another embodiment, antibodies which specifically bind RNAAP may be used for the 
5 diagnosis of disorders characterized by expression of RNAAP, or in assays to monitor patients 
being treated with RNAAP or agonists, antagonists, or inhibitors of RNAAP. Antibodies useful 
for diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for RNAAP include methods which utilize the antibody and a label to detect 
RNAAP in human body fluids or in extracts of cells or tissues. The antibodies may be used with 

10 or without modification, and may be labeled by covaient or non-covalent attachment of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known 
in the art and may be used. 

A variety of protocols for measuring RNAAP, including ELISAs, RIAs, and FACS, are 
known in the art and provide a basis for diagnosing altered or abnormal levels of RNAAP 

]5 expression. Normal or standard values for RNAAP expression are established by combining body 
fluids or cell extracts taken from normal mammalian subjects, preferably human, with antibody to 
RNAAP under conditions suitable for complex formation. The amount of standard complex 
formation may be quantitated by various methods, preferably by photometric means. Quantities of 
RNAAP expressed in subject, control, and disease samples from biopsied tissues are compared 

20 with the standard values. Deviation between standard and subject values establishes the 
parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding RNAAP may be 
used for diagnostic purposes. The polynucleotides which may be used include oligonucleotide 
sequences, complementary RNA and DNA molecules, and PNAs. The polynucleotides may be 

25 used to detect and quantitatc gene expression in biopsied tissues in which expression of RNAAP 
may be correlated with disease. The diagnostic assay may be used to determine absence, 
presence, and excess expression of RNAAP, and to monitor regulation of RNAAP levels during 
therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting 

30 polynucleotide sequences, including genomic sequences, encoding RNAAP or closely related 

molecules may be used to identify nucleic acid sequences which encode RNAAP. The specificity 
of the probe, whether it is made from a highly specific region, e.g., the 5' regulatory region, or 
from a less specific region, e.g., a conserved motif, and the stringency of the hybridization or 
amplification (maximal, high, intermediate, or low), will determine whether the probe identifies 
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only naturally occurring sequences encoding RNAAP, allelic variants, or related sequences. 

Probes may also be used for the detection of related sequences, and should preferably 
have at least 50% sequence identity to any of the RNAAP encoding sequences. The hybridization 
probes of the subject invention may be DNA or RNA and may be derived from the sequence of 
5 SEQ ID NO:26-50 or from genomic sequences including promoters, enhancers, and introns of the 
RNAAP gene. 

Means for producing specific hybridization probes for DNAs encoding RNAAP include 
the cloning of polynucleotide sequences encoding RNAAP or RNAAP derivatives into vectors for 
the production of mRNA probes. Such vectors are known in the art, arc commercially available, 
10 and may be used to synthesize RNA probes in vitro by means of the addition of the appropriate 
RNA polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled 
by a variety of reporter groups, for example, by radionuclides such as ^-P or ''^S. or by enzymatic 
labels, such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and 
the like. 

15 Polynucleotide sequences encoding RNAAP may be used for the diagnosis of disorders 

associated with expression of RNAAP. Examples of such disorders include, but are not limited to, 
a cell proliferative disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, 
cirrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 

20 adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, 
parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an 
autoimmune/inflammatory disorder such as acquired immunodeficiency syndrome (AIDS), 

25 Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, 
amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune 
thyroiditis, autoimmune poiyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), 
bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 
diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis 

30 fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout. 
Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple 
sclerosis, myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis. 
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thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 
helminthic infections, and trauma; and an infectious disorder such as infections by viral agents 
classified as adenovirus, arenavirus, bunyavirus, calicivirus, coronavirus, filovirus, hepadnavirus, 
5 herpesvirus, flavivirus, orthomyxovirus, parvovirus, papovavirus, paramyxovirus, picornavirus, 
poxvirus, reovirus, retrovirus, rhabdovirus, and togavirus; infections by bacterial agents classified 
as pneumococcus, staphylococcus, streptococcus, bacillus, corynebacterium, Clostridium, 
meningococcus, gonococcus, listeria, moraxelia, kingella, haemophilus, legionella, bordetella, 
gram-negative enterobacterium including shigella, salmonella, and Campylobacter, pseudomonas, 

10 vibrio, brucella, francisella, yersinia, bartonella, norcardium, actinomyces, mycobacterium, 

spirochaetale, rickettsia, chlamydia, and mycoplasma: infections by fungal agents classified as 
aspergillus, blastomyccs, dermatophytes, cryptococcus, coccidioides, malasezzia, histoplasma, and 
other fungal agents causing various mycoses; and infections by parasites classified as plasmodium 
or malaria-causing, parasitic entamoeba, leishmania, trypanosoma, toxoplasma, Pneumocystis 

15 carinii, intestinal protozoa such as giardia, trichomonas, tissue nematodes such as trichinella, 
intestinal nematodes such as ascaris, lymphatic filarial nematodes, trematodes such as 
schistosoma, and cestodes (tapeworm). The polynucleotide sequences encoding RNAAP may be 
used in Southern or northern analysis, dot blot, or other membrane-based technologies; in PCR 
technologies; in dipstick, pin, and multiformat ELISA-like assays; and in microarrays utilizing 

20 fluids or tissues from patients to detect altered RNAAP expression. Such qualitative or 
quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding RNAAP may be useful in assays 
that detect the presence of associated disorders, particularly those mentioned above. The 
nucleotide sequences encoding RNAAP may be labeled by standard methods and added to a fluid 

25 or tissue sample from a patient under conditions suitable for the formation of hybridization 

complexes. After a suitable incubation period, the sample is washed and the signal is quantitated 
and compared with a standard value. If the amount of signal in the patient sample is significantly 
altered in comparison to a control sample then the presence of altered levels of nucleotide 
sequences encoding RNAAP in the sample indicates the presence of the associated disorder. Such 

30 assays may also be used to evaluate the efficacy of a particular therapeutic treatment regimen in 
animal studies, in clinical trials, or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 
RNAAP, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
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sequence, or a fragment thereof, encoding RNAAP, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 
normal subjects with values from an experiment in v^hich a known amount of a substantially 
purified polynucleotide is used. Standard values obtained in this manner may be compared with 
5 values obtained from samples from patients who are symptomatic for a disorder Deviation from 
standard values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in 
the patient begins to approximate that which is observed in the normal subject. The results 

10 obtained from successive assays may be used to show the efficacy of treatment over a period 
ranging from several days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the 

15 appearance of actual clinical symptoms. A more definitive diagnosis of this type may allow health 
professionals to employ preventative measures or aggressive treatment earlier thereby preventing 
the development or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 
RNAAP may involve the use of PCR. These oligomers may be chemically synthesized, generated 

20 enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a 

polynucleotide encoding RNAAP, or a fragment of a polynucleotide complementary to the 
polynucleotide encoding RNAAP, and will be employed under optimized conditions for 
identification of a specific gene or condition. Oligomers may also be employed under less 
stringent conditions for detection or quantitation of closely related DNA or RNA sequences. 

25 Methods which may also be used to quantify the expression of RNAAP include 

radiolabeling or biotinylating nucleotides, coamplification of a control nucleic acid, and 
interpolating results from standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. 
Methods 159:235-244; Duplaa, C. et al. (1993) Anal. Biochem. 212:229-236.) The speed of 
quantitation of multiple samples may be accelerated by running the assay in an ELISA format 

30 where the oligomer of interest is presented in various dilutions and a spectrophotometric or 
colorimetric response gives rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as targets in a microarray. The 
microarray can be used to monitor the expression level of large numbers of genes simultaneously 
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and to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, and 
to develop and monitor the activities of therapeutic agents. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, 
5 e.g., Brennan, T.M. et ai. (1995) U.S. Patent No. 5,474,796; Schena, M, et al. (1996) Proc. Natl. 
Acad. Sci. 93:10614-10619; Baldeschweiler et al. (1995)PCT application W095/251116; Shaion, 
D. et al. (1995) PCX application WO95/35505; Heller, R.A. ct al. (1997) Proc. Natl. Acad. Sci. 
94:2150-2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding RNAAP may be 
10 used to generate hybridization probes useful in mapping the naturally occurring genomic 

sequence. The sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes 
(HACs), yeast artiTicial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial 
PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. el al. (1997) 
15 Nat Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends 
Genet. 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical 
chromosome mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. (1995) in 
Meyers, supra , pp. 965-968.) Examples of genetic map data can be found in various scientific 

20 journals or at the Online Mendelian Inheritance in Man (OMIM) site. Correlation between the 

location of the gene encoding RNAAP on a physical chromosomal map and a specific disorder, or 
a predisposition to a specific disorder, may help define the region of DNA associated with that 
disorder. The nucleotide sequences of the invention may be used to detect differences in gene 
sequences among normal, carrier, and affected individuals. 

25 In situ hybridization of chromosomal preparations and physical mapping techniques, such 

as linkage analysis using established chromosomal markers, may be used for extending genetic 
maps. Often the placement of a gene on the chromosome of another mammalian species, such as 
mouse, may reveal associated markers even if the number or arm of a particular human 
chromosome is not known. New sequences can be assigned to chromosomal arms by physical 

30 mapping. This provides valuable information to investigators searching for disease genes using 
positional cloning or other gene discovery techniques. Once the disease or syndrome has been 
crudely localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 
I lq22-23, any sequences mapping to that area may represent associated or regulatory genes for 
further investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide 
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sequence of the subject invention may also be used to detect differences in the chromosomal 
location due to translocation, inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, RNAAP, its catalytic or immunogenic fragments, 
or oligopeptides thereof can be used for screening libraries of compounds in any of a variety of 
5 drug screening techniques. The fragment employed in such screening may be free in solution, 
affixed to a solid support, borne on a cell surface, or located intracellularly. The fonnation of 
binding complexes between RNAAP and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of 
compounds having suitable binding affinity to the protein of interest, (See, e.g., Geysen, et al. 

10 (1984) PCT application WO84/03564.) In this method, large numbers of different small test 

compounds are synthesized on a solid substrate. The test compounds are reacted with RNAAP, or 
fragments thereof, and washed. Bound RNAAP is then detected by methods well known in the 
art. Purified RNAAP can also be coated directly onto plates for use in the aforementioned drug 
screening techniques. Alternatively, non-neutralizing antibodies can be used to capture the 

15 peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which 
neutralizing antibodies capable of binding RNAAP specifically compete with a test compound for 
binding RNAAP. In this manner, antibodies can be used to detect the presence of any peptide 
which shares one or more antigenic determinants with RNAAP. 

20 In additional embodiments, the nucleotide sequences which encode RNAAP may be used 

in any molecular biology techniques that have yet to be developed, provided the new techniques 
rely on properties of nucleotide sequences that are currently known, including, but not limited to, 
such properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the 

25 preceding description, utilize the present invention to its fullest extent. The following preferred 
specific embodiments are, therefore, to be construed as merely illustrative, and not limitative of 
the remainder of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, 
in particular U.S. Ser. No. 60/1 15,639 and U.S. Ser No. 60/097,550, are hereby expressly 

30 incorporated by reference. 

EXAMPLES 

I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
tissues were homogenized and lysed in guanidinium isothiocyanate, while others were 



42 



wo 00/11171 PCT/US99/19361 

homogenized and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life 
Technologies), a monophasic solution of phenol and guanidine isothiocyanate. The resulting 
lysates were centrifuged over CsCl cushions or extracted with chloroform. RNA was precipitated 
from the lysates with either isopropanol or sodium acetate and cthanol, or by other routine 
5 methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was 
isolated using oligo d(T)-coupled paramagnetic particles (Promcga), OLIGOTEX latex particles 
(QI AGEN, Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, 

10 RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g., the 
POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding 
cDNA libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 
UNIZAP vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), 

15 using the recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 
1997, supra , units 5. 1-6.6.) Reverse transcription was initiated using oligo d(T) or random 
primers. Synthetic oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA 
was digested with the appropriate restriction enzyme or enzymes. For most libraries, the cDNA 
was size-selected (300-1000 bp) using SEPHACRYL SI 000, SEPH AROSE CL2B, or 

20 SEPHAROSE CL4B column chromatography (Amersham Pharmacia Biotech) or preparative 

agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of the 
poiylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid (Stratagene), PSPORTl plasmid 
(Life Technologies), or pINCY (Incyte Pharmaceuticals, Palo Alto CA). Recombinant plasmids 
were transformed into competent E. coli cells including XL 1 -BLUE, XLl-BLUEMRF, or SOLR 

25 from Stratagene or DH5a, DHIOB, or ELECTROMAX DHIOB from Life Technologies. 
II. Isolation of cDNA Clones 

Plasmids were recovered from host cells by in vivo excision, using the UNIZAP vector 
system (Stratagene) or cell lysis. Plasmids were purified using at least one of the following: a 
Magic or WIZARD Miniprcps DNA purification system (Promega); an AGTC Miniprep 

30 purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 8 
Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 96 plasmid 
purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 ml of 
distilled water and stored, with or without lyophiiization, at 4^C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in 
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a high-throughput format (Rao, V,B. ( 1 994) Anal. Biochem. 216:1-14), Host cell lysis and 
thermal cycling steps were carried out in a single reaction mixture. Samples were processed and 
stored in 384-well plates, and the concentration of amplified plasmid DNA was quantified 
fluorometrically using PICOGREEN dye (Molecular Probes, Eugene OR) and a Fluoroskan II 
5 fluorescence scanner (Labsystems Oy, Helsinki, Finland). 
IIL Sequencing and Analysis 

cDNA sequencing reactions were processed using standard methods or high-throughput 
instrumentation such as the ABI CATALYST 800 (Perkin-Elmcr) thermal cycler or the PTC-200 
thermal cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins 

10 Scientific) or the MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing 

reactions were prepared using reagents provided by Amersham Pharmacia Biotech or supplied in 
ABI sequencing kits such as the ABI PRISM BIGDYE Terminator cycle sequencing ready 
reaction kit (Perkin-Elmer). Electrophoretic separation of cDNA sequencing reactions and 
detection of labeled polynucleotides were carried out using the MEGABACE 1000 DNA 

15 sequencing system (Molecular Dynamics); the ABI PRISM 373 or 377 sequencing systems 
(Perkin-Elmer) in conjunction with standard ABI protocols and base calling software; or other 
sequence analysis systems known in the art. Reading frames within the cDNA sequences were 
identified using standard methods (reviewed in Ausubel, 1997, supra , unit 7.7). Some of the 
cDNA sequences were selected for extension using the techniques disclosed in Example V. 

20 The polynucleotide sequences derived from cDNA sequencing were assembled and 

analyzed using a combination of software programs which utilize algorithms well known to those 
skilled in the art. Table 5 summarizes the tools, programs, and algorithms used and provides 
applicable descriptions, references, and threshold parameters. The first column of Table 5 shows 
the tools, programs, and algorithms used, the second column provides brief descriptions thereof, 

25 the third column presents appropriate references, all of which are incorporated by reference herein 
in their entirety, and the fourth column presents, where applicable, the scores, probability values, 
and other parameters used to evaluate the strength of a match between two sequences (the higher 
the score, the greater the homology between two sequences). Sequences were analyzed using 
MACDNASIS PRO software (Hitachi Software Engineering, South San Francisco CA) and 

30 LASERGENE software (DNASTAR). 

The polynucleotide sequences were validated by removing vector, linker, and polyA 
sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then 
queried against a selection of public databases such as the GenBank primate, rodent, mammalian, 



44 



wo 00/1 1171 PCT/US99/1 9361 

vertebrate, and eukaryote databases, and BLOCKS to acquire annotation using programs based on 
BLAST, FASTA, and BLIMPS. The sequences were assembled into full length polynucleotide 
sequences using programs based on Phred, Phrap, and Consed, and were screened for open 
reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 
5 polynucleotide sequences were translated to derive the corresponding full length amino acid 
sequences, and these full length sequences were subsequently analyzed by querying against 
databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, 
Prosite, and Hidden Markov Model (HMM)-based protein family databases such as PFAM. 
HMM is a probabilistic approach which analyzes consensus primary structures of gene families. 

10 (See, e-g,, Eddy, S.R. (1996) Curr. Opin, Str. BioL 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide 
and amino acid sequences were also used to identify polynucleotide sequence fragments from 
SEQ ID NO:26-50. Fragments from about 20 to about 4000 nucleotides which are useful in 
hybridization and amplification technologies were described in The Invention section above, 

15 IV, Northern Analysis 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which 
RNAs from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; 
Ausubel, 1995, supra , ch. 4 and 16.) 

20 Analogous computer techniques applying BLAST were used to search for identical or 

related molecules in nucleotide databases such as GenBank or LIFESEQ (Incyte Pharmaceuticals). 
This analysis is much faster than multiple membrane-based hybridizations. In addition, the 
sensitivity of the computer search can be modified to determine whether any particular match is 
categorized as exact or similar. The basis of the search is the product score, which is defined as: 

25 % sequence identity x % maximum BLAST score 

100 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. For example, with a product score of 40, the match will be exact 
within a 1% to 2% error, and, with a product score of 70, the match will be exact. Similar 
30 molecules are usually identified by selecting those which show product scores between 15 and 40, 
although lower scores may identify related molecules. 

The results of northern analyses are reported as a percentage distribution of libraries in 
which the transcript encoding RNAAP occurred. Analysis involved the categorization of cDNA 
libraries by organ/tissue and disease. The organ/tissue categories included cardiovascular. 
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dermatologic, developmental, endocrine, gastrointestinal, hematopoietic/imnnune, musculoskeletal, 
nervous, reproductive, and urologic. The disease/condition categories included cancer, 
inflammation/trauma, cell proliferation, neurological, and pooled. For each category, the number 
of libraries expressing the sequence of interest was counted and divided by the total number of 
5 libraries across all categories. Percentage values of tissue-specific and disease- or condition- 
specific expression are reported in Table 3. 
V. Extension of RNAAP Encoding Polynucleotides 

The full length nucleic acid sequences of SEQ ID NO:26-50 were produced by extension 
of an appropriate fragment of the full length molecule using oligonucleotide primers designed 

10 from this fragment. One primer was synthesized to initiate 5' extension of the known fragment, 
and the other primer, to initiate 3' extension of the known fragment. The initial primers were 
designed using OLIGO 4.06 software (National Biosciences), or another appropriate program, to 
be about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal 
to the target sequence at temperatures of about 68°C to about 72 ""C. Any stretch of nucleotides 

15 which would result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PGR using methods well known in the art. 
PGR was performed in 96-weli plates using the PTG-200 thermal cycler (MJ Research, Inc.). The 

20 reaction mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg^", 
(NH4)2S04, and P-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), 
ELONGASE enzyme (Life Technologies), and Pfu DNA polymerase (Stratagene), with the 
following parameters for primer pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94 °C, 15 sec; 
Step 3: 60 °C, 1 min; Step 4: 68 ""C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 

25 68 °C, 5 min; Step 7: storage at 4°C. In the alternative, the parameters for primer pair T7 and SK+ 
were as follows: Step 1: 94^C, 3 min; Step 2: 94°C, 15 sec; Step 3: 57'^C, 1 min; Step 4: 68^C, 2 
min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 °C, 5 min; Step 7: storage at 4°C, 

The concentration of DNA in each well was determined by dispensing 100 \x\ 
PIGOGREEN quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) 

30 dissolved in IX TE and 0.5 \i\ of undiluted PCR product into each well of an opaque fluorimeter 
plate (Coming Costar, Acton MA), allowing the DNA to bind to the reagent. The plate was 
scanned in a Fluoroskan II (Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the 
sample and to quantify the concentration of DNA. A 5 yul to 10 /u\ aliquot of the reaction mixture 
was analyzed by electrophoresis on a 1 % agarose mini-gel to determine which reactions were 
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successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-weIl piates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
5 shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended 
clones were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector 
(Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in 
restriction site overhangs, and transfected into competent E. coli cells. Transformed cells were 
10 selected on antibiotic-containing media, individual colonies were picked and cultured overnight at 
37 °C in 384-welI plates in LB/2x carb liquid media. 

The ceils were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 
parameters: Step 1 : 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60"C, 1 min; Step 4; 72"C, 2 min; 
15 Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72 "C, 5 min; Step 7: storage at 4°C. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low 
DNA recoveries were reamplificd using the same conditions as described above. Samples were 
diluted with 20% dimcthysulphoxide (1 :2, v/v), and sequenced using DYENAMIC energy transfer 
sequencing primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI 
20 PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Perkin-Elmer). 

In like manner, the nucleotide sequences of SEQ ID NO:26-50 are used to obtain 5' 
regulatory sequences using the procedure above, oligonucleotides designed for such extension, 
and an appropriate genomic library. 

VI. Labeling and Use of Individual Hybridization Probes 

25 Hybridization probes derived from SEQ ID NO:26-50 are employed to screen cDNAs, 

genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 
base pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 juC'i of 

30 [y-^'P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
An aliquot containing 10'^ counts per minute of the labeled probe is used in a typical membrane- 
based hybridization analysis of human genomic DNA digested with one of the following 
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endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xbal, or Pvu 11 (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 

hours at 40 °C. To remove nonspecific signals, blots are sequentially washed at room temperature 
5 under increasingly stringent conditions up to 0.1 x saline sodium citrate and 0.5% sodium dodecyl 

sulfate. Hybridization patterns are visualized using autoradiography and compared. 

VIL Microarrays 

A chemical coupling procedure and an ink jet device can be used to synthesize array 

elements on the surface of a substrate. (See, e.g., Baldeschweiler, supra .) An array analogous to a 
10 dot or slot blot may also be used to arrange and link elements to the surface of a substrate using 

thermal, UV, chemical, or mechanical bonding procedures, A typical array may be produced by 

hand or using available methods and machines and contain any appropriate number of elements. 

After hybridization, nonhybridized probes are removed and a scanner used to determine the levels 

and patterns of fluorescence. The degree of complementarity and the relative abundance of each 
15 probe which hybridizes to an element on the microarray may be assessed through analysis of the 

scanned images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may 
comprise the elements of the microarray. Fragments suitable for hybridization can be selected 
using software well known in the art such as LASERGENE software (DNASTAR). Full-length 

20 cDNAs, ESTs, or fragments thereof corresponding to one of the nucleotide sequences of the 

present invention, or selected at random from a cDNA library relevant to the present invention, are 
arranged on an appropriate substrate, e.g., a glass slide. The cDNA is fixed to the slide using, e.g., 
UV cross-linking followed by thermal and chemical treatments and subsequent drying. (See, e.g., 
Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645.) 

25 Fluorescent probes are prepared and used for hybridization to the elements on the substrate. The 
substrate is analyzed by procedures described above. 
VIII, Complementary Polynucleotides 

Sequences complementary to the RNAAP-encoding sequences, or any parts thereof, are 
used to detect, decrease, or inhibit expression of naturally occurring RNAAP. Although use of 

30 oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 

procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides 
are designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of 
RNAAP. To inhibit transcription, a complementary oligonucleotide is designed from the most 
unique 5* sequence and used to prevent promoter binding to the coding sequence. To inhibit 
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translation, a complementary oligonucleotide is designed to prevent ribosomal binding to the 
RNAAP-encoding transcript. 

IX. Expression of RNAAP 

Expression and purification of RNAAP is achieved using bacterial or virus-based 
5 expression systems. For expression of RNAAP in bacteria, cDNA is subcloned into an 

appropriate vector containing an antibiotic resistance gene and an inducible promoter that directs 
high levels of cDNA transcription. Examples of such promoters include, but are not limited to, the 
trp-lac (tac) hybrid promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac 
operator regulatory element. Recombinant vectors are transformed into suitable bacterial hosts, 

10 e.g., BL21(DE3). Antibiotic resistant bacteria express RNAAP upon induction with isopropyl 
beta-D-thioga!actopyranoside (IPTG). Expression of RNAAP in eukaryotic cells is achieved by 
infecting insect or mammalian cell lines with recombinant Autographica californica nuclear 
polyhedrosis virus (AcMNPV), commonly known as baculovirus. The nonessential poiyhedrin 
gene of baculovirus is replaced with cDNA encoding RNAAP by either homologous 

15 recombination or bacterial-mediated transposition involving transfer plasm id intermediates. Viral 
infectivity is maintained and the strong poiyhedrin promoter drives high levels of cDNA 
transcription. Recombinant baculovirus is used to infect Spodoptcra frugiperda (Sf9) insect cells 
in most cases, or human hepatocytes, in some cases. Infection of the latter requires additional 
genetic modifications to baculovirus. (See Engelhard, E. K. et al. (1994) Proc. Natl. Acad. Sci. 

20 USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Thcr. 7:1937-1945.) 

In most expression systems, RNAAP is synthesized as a fusion protein with, e.g., 
glutathione S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, 
single-step, affinity-based purification of recombinant fusion protein from crude cell lysates. 
GST, a 26-kilodalton enzyme from Schistosoma japonicum , enables the purification of fusion 

25 proteins on immobilized glutathione under conditions that maintain protein activity and 

antigenicity (Amersham Pharmacia Biotech). Following purification, the GST moiety can be 
proteolytically cleaved from RNAAP at specifically engineered sites. FLAG, an 8-amino acid 
peptide, enables immunoaffinity purification using commercially available monoclonal and 
polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a stretch of six consecutive histidine 

30 residues, enables purification on metal-chelate resins (QIAGEN). Methods for protein expression 
and purification are discussed in Ausubel (1995, supra , ch 10 and 16). Purified RNAAP obtained 
by these methods can be used directly in the following activity assay. 

X. Demonstration of RNAAP Activity 

RNAAP activity is demonstrated by the formation of an RNAAP-RNA complex as 
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detected by a poiyacryl amide gei mobiJity-shift assay. In preparation for this assay, RNAAP is 
expressed by transforming a mammalian cell line such as COS7, HeLa or CHO with a eukaryotic 
expression vector containing RNAAP cDNA. The cells are incubated for 48-72 hours after 
transformation under conditions which allow expression and accumulation of RNAAP. Extracts 
5 containing solubilized proteins can be prepared from cells expressing RNAAP by methods well 
known in the art. Portions of the extract containing RNAAP are added to p-P]-labeled RNA. 
Radioactive RNA can be synthesized in vitro by techniques well known in the art. The mixtures 
are incubated at 25 °C in the presence of RNase inhibitors under buffered conditions for 5-10 
minutes. After incubation, the samples are analyzed by polyacrylamide gel electrophoresis 

10 followed by autoradiography. The presence of a high molecular weight band on the 

autoradiogram indicates the formation of a complex between RNAAP and the radioactive 
transcript. A band of significantly lower molecular weight will be present in samples prepared 
using control extracts prepared from untransformed cells. The amount of RNAAP-RNA complex 
can be quantified using phospho-imagc analysis and is proportional to the activity of RNAAP. 

15 Alternatively, RNAAP, or biologically active fragments thereof, are labeled with '^^I 

Bolton-Hunter reagent. (See, e.g., Bolton et al. (1973) Biochem. J. 133:529.) Candidate 
molecules previously arrayed in the wells of a multi-well plate are incubated with the labeled 
RNAAP, washed, and any wells with labeled RNAAP complex are assayed. Data obtained using 
different concentrations of RNAAP are used to calculate values for the number, affinity, and 

20 association of RNAAP with the candidate molecules. 



XI, Functional Assays 

RNAAP function is assessed by expressing the sequences encoding RNAAP at 
physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 

25 mammalian expression vector containing a strong promoter that drives high levels of cDNA 
expression. Vectors of choice include pCMV SPORT (Life Technologies) and pCR3.1 
(Invitrogcn, Carlsbad CA), both of which contain the cytomegalovirus promoter. 5-10 A^g of 
recombinant vector are transiently transfected into a human cell line, preferably of endothelial or 
hematopoietic origin, using either liposome formulations or electroporation, 1-2 /^g of an 

30 additional plasmid containing sequences encoding a marker protein are co-transfected. Expression 
of a marker protein provides a means to distinguish transfected cells from n on transfected cells and 
is a reliable predictor of cDNA expression from the recombinant vector. Marker proteins of 
choice include, e.g.. Green Fluorescent Protein (GFP; Clontech), CD64, or a CD64-GFP fusion 
protein. Flow cytometry (FCM), an automated, laser optics-based technique, is used to identify 
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transfected cells expressing GFP or CD64-GFP and to evaluate the apoptotic state of the cells and 
other cellular properties. FCM detects and quantifies the uptake of fluorescent molecules that 
diagnose events preceding or coincident with cell death. These events include changes in nuclear 
DNA content as measured by staining of DNA with propidium iodide; changes in cell size and 
5 granularity as measured by forward light scatter and 90 degree side light scatter; down-regulation 
of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; alterations in 
expression of cell surface and intracellular proteins as measured by reactivity with specific 
antibodies; and alterations in plasma membrane composition as measured by the binding of 
fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 

10 discussed in Ormerod, M. G. (1994) Flow Cytometry , Oxford, New York NY. 

The influence of RNAAP on gene expression can be assessed using highly purified 
populations of cells transfected with sequences encoding RNAAP and either CD64 or CD64-GFP. 
CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved 
regions of human immunoglobulin G (IgG). Transfected cells are efficiently separated from 

15 nontransfected cells using magnetic beads coated with either human IgG or antibody against CD64 
(DYNAL, Lake Success NY). mRNA can be purified from the cells using methods well known 
by those of skill in the art. Expression of mRNA encoding RNAAP and other genes of interest 
can be analyzed by northern analysis or microarray techniques. 
XII, Production of RNAAP Specific Antibodies 

20 RNAAP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 

Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is 
used to immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the RNAAP amino acid sequence is analyzed using LASERGENE 
software (DNASTAR) to determine regions of high immunogenicity, and a corresponding 

25 oligopeptide is synthesized and used to raise antibodies by means known to those of skill in the 
art. Methods for selection of appropriate epitopes, such as those near the C-terminus or in 
hydrophilic regions are well described in the art. (See, e,g., Ausubel, 1995. supra , ch. 1 1.) 

Typically, oligopeptides 15 residues in length are synthesized using an ABI 431 A peptide 
synthesizer (Pcrkin-Elmer) using fmoc-chemistry and coupled to KLH (Sigma-Aldrich, St. Louis 

30 MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 

immunogenicity. (See, e.g., Ausubel, 1995. supra .) Rabbits are immunized with the oligopeptide- 
KLH complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide activity 
by, for example, binding the peptide to plastic, blocking with 1% BSA, reacting with rabbit 
antisera, washing, and reacting with radio- iodinated goat anti-rabbit IgG. 
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XIII. Purification of Naturally Occurring RNAAP Using Specific Antibodies 

Naturally occurring or recombinant RNAAP is substantially purified by immunoaffinity 
chromatography using antibodies specific for RNAAP. An immunoaffinity column is constructed 
by covalently coupling anti-RNAAP antibody to an activated chromatographic resin, such as 
5 CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing RNAAP are passed over the immunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of RNAAP (e.g., high ionic 
strength buffers in the presence of detergent). The column is eluted under conditions that disrupt 
10 antibody/RNAAP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, 
such as urea or thiocyanate ion), and RNAAP is collected. 
XIV- Identification of Molecules Which Interact with RNAAP 

RNAAP, or biologically active fragments thereof, are labeled with '-^I Bolton-Hunter 
reagent. (See, e.g., Bolton et al. (1973) Biochem. J. 133:529.) Candidate molecules previously 
15 arrayed in the wells of a multi-well plate are incubated with the labeled RNAAP, washed, and any 
wells with labeled RNAAP complex are assayed. Data obtained using different concentrations of 
RNAAP are used to calculate values for the number, affinity, and association of RNAAP with the 
candidate molecules. 

Various modifications and variations of the described methods and systems of the 
20 invention will be apparent to those skilled in the art without departing from the scope and spirit of 
the invention. Although the invention has been described in connection with specific preferred 
embodiments, it should be understood that the invention as claimed should not be unduly limited 
to such specific embodiments. Indeed, various modifications of the described modes for carrying 
out the invention which are obvious to those skilled in molecular biology or related fields are 
25 intended to be within the scope of the following claims. 
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What is claimed is: 



PCT/US99/19361 



1 . A substantially purified polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-25 and fragments thereof. 

2. A substantially purified variant having at least 90% amino acid sequence identity 
to the amino acid sequence of claim 1 . 

3. An isolated and purified polynucleotide encoding the polypeptide of claim 1. 

4. An isolated and purified polynucleotide variant having at least 70% 
polynucleotide sequence identity to the polynucleotide of claim 3. 

5. An isolated and purified polynucleotide which hybridizes under stringent 
conditions to the polynucleotide of claim 3. 

6. An isolated and purified polynucleotide having a sequence which is 
complementary to the polynucleotide of claim 3. 

7. A method for detecting a polynucleotide, the method comprising the steps of: 

(a) hybridizing the polynucleotide of claim 6 to at least one nucleic acid in a 
sample, thereby forming a hybridization complex; and 

(b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of the polynucleotide in the sample. 



8. The method of claim 7 further comprising amplifying the polynucleotide prior to 
hybrid ization. 

9. An isolated and purified polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of SEQ ID NO:26-50 and fragments thereof. 

10. An isolated and purified polynucleotide variant having at least 70% 



polynucleotide sequence identity to the polynucleotide of claim 9. 
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11- An isolated and purified polynucleotide having a sequence which is 
complementary to the polynucleotide of claim 9. 

12. An expression vector comprising at least a fragment of the polynucleotide of 

5 claim 3. 



13. A host cell comprising the expression vector of claim 12. 

14. A method for producing a polypeptide, the method comprising the steps of: 
10 a) culturing the host cell of claim 13 under conditions suitable for the 

expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

15. A pharmaceutical composition comprising the polypeptide of claim i in 
15 conjunction with a suitable pharmaceutical carrier. 

16. A purified antibody which specifically binds to the polypeptide of claim 1. 

1 7. A purified agonist of the polypeptide of claim 1 , 

20 

18. A purified antagonist of the polypeptide of claim 1. 

19. A method for treating or preventing a disorder associated with decreased 
expression or activity of RNAAP, the method comprising administering to a subject in need of 

25 such treatment an effective amount of the pharmaceutical composition of claim 15. 

20. A method for treating or preventing a disorder associated with increased 
expression or activity of RNAAP, the method comprising administering to a subject in need of 
such treatment an effective amount of the antagonist of claim 18. 

30 
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<12 0> HUMAN RNA- ASSOCIATED PROTEINS 

<130> PF-0579 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/097,550; 60/115,639 
<151> 1998-08-21; 1999-01-12 

<160> 50 

<170> PERL Program 

<210> 1 
<211> 216 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte Identification No.: 399781CD1 
<400> 1 

Met Ser Arg Tyr Leu Arg Pro Pro Asn Thr Ser Leu Phe Val Arg 
15 10 15 

Asn Val Ala Asp Asp Thr Arg Ser Glu Asp Leu Arg Arg Glu Phe 

20 25 30 

Gly Arg Tyr Gly Pro lie Val Asp Val Tyr Val Pro Leu Asp Phe 

35 40 45 

Tyr Thr Arg Arg Pro Arg Gly Phe Ala Tyr Val Gin Phe Glu Asp 

50 55 60 

Val Arg Asp Ala Glu Asp Ala Leu His Asn Leu Asp Arg Lys Trp 

65 70 75 

lie Cys Gly Arg Gin lie Glu lie Gin Phe Ala Gin Gly Asp Arg 
80 85 90 
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Lys Thr Pro Asn Gin Met Lys Ala Lys Glu Gly Arg Asn Val Tyr 

95 100 105 

Ser Ser Ser Arg Tyr Asp Asp Tyr Asp Arg Tyr Arg Arg Ser Arg 

110 115 120 

Ser Arg Ser Tyr Glu Arg Arg Arg Ser Arg Ser Arg Ser Phe Asp 

125 130 135 

Tyr Asn Tyr Arg Arg Ser Tyr Ser Pro Arg Asn Ser Arg Pro Thr 

140 145 150 

Gly Arg Pro Arg Arg Arg Glu Ala lie Pro Thr Met lie Asp Gin 

155 160 165 

Thr Ala Ala Gly lie Pro Ser Thr Val Leu Leu Thr Thr Leu Gin 

170 175 180 

Glu Arg Ser Glu Ser Gly Lys Arg Thr Lys Glu Gly Gin Phe Lys 

185 190 195 

Arg Pro Lys Gly Gly Trp Lys Val Leu Gin Tyr Glu Tyr Cys Thr 

200 205 210 
Asn lie Leu Thr Leu Val 

215 



<210> 2 
<211> 962 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte Identification No. : 1806542CD1 



<400> 2 
Met Asp Glu 
1 

Asp Phe Arg 
Ser Pro Asp 
Thr Tyr Ser 
Glu His Gin 
Gin Gin Leu 
Met Leu Asn 
Ala Gin Val 
Trp Pro Lys 
Pro Arg Gly 
Ser Glu Leu 
Arg Arg Asn 
Pro Asn Leu 
Phe Thr Asn 



Gin Ala Leu 
5 

Gin Arg Ala 
20 

Ala Trp Gin 
35 

Asp Asp His 
50 

Val Lys Tyr 
65 

He Arg Glu 
80 

Pro Gin Pro 
95 

Phe Ala Leu 

110 
Phe Phe Phe 

125 
Val Asp Leu 

140 
Val Asp Arg 

155 
Thr Leu He 

170 
Val Glu Ser 

185 
Ser Glu Val 



Leu Gly Leu 
Leu Ala Tyr 
Val Cys Ala 
Val Lys Phe 
Lys Tyr Ser 
Thr Leu He 
Glu Lys Thr 
Leu Phe Val 
Asp He Leu 
Tyr Leu Arg 
Asp Val Val 
Lys Asp Thr 
Trp Tyr Gin 
Thr Cys Gin 



Asn Pro Asn 
10 

Phe Glu Gin 
25 

Glu Ala Leu 
40 

Phe Cys Phe 
55 

Glu Leu Thr 
70 

Ser Trp Leu 

85 

Phe He Arg 
100 

Thr Glu Tyr 
115 

Ser Val Val 
130 

He Leu Met 
145 

His Thr Ser 
160 

Met Arg Glu 
175 

He Leu Gin 
190 

Cys Leu Glu 



Ala Asp Ser 
15 

Leu Lys He 
30 

Ala Gin Arg 
45 

Gin Val Leu 
60 

Thr Val Gin 
75 

Gin Ala Gin 
90 

Asn Lys Ala 
105 

Leu Thr Lys 
120 

Asp Leu Asn 
135 

Ala He Asp 
150 

Glu Glu Ala 
165 

Gin Cys He 
180 

Asn Tyr Gin 
195 

Val Val Gly 
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200 



205 



210 



Ala Tyr Val Ser Trp lie Asp Leu Ser Leu lie Ala Asn Asp Arg 

215 220 225 
Phe lie Asn Met Leu Leu Gly His Met Ser lie Glu Val Leu Arg 

230 235 240 

Glu Glu Ala Cys Asp Cys Leu Phe Glu Val Val Asn Lys Gly Met 

245 250 255 
Asp Pro Val Asp Lys Met Lys Leu Val Glu Ser Leu Cys Gin Val 

260 265 270 
Leu Gin Ser Ala Gly Phe Phe Ser lie Asp Gin Glu Glu Asp Val 

275 280 285 

Asp Phe Leu Ala Arg Phe Ser Lys Leu Val Asn Gly Met Gly Gin 

290 295 300 

Ser Leu lie Val Ser Trp Ser Lys Leu lie Lys Asn Gly Asp lie 

305 310 315 

Lys Asn Ala Gin Glu Ala Leu Gin Ala lie Glu Thr Lys Val Ala 

320 325 330 

Leu Met Leu Gin Leu Leu lie His Glu Asp Asp Asp lie Ser Ser 

335 340 345 

Asn lie lie Gly Phe Cys Tyr Asp Tyr Leu His lie Leu Lys Gin 

350 355 360 

Leu Thr Val Leu Ser Asp Gin Gin Lys Ala Asn Val Glu Ala lie 

365 370 375 

Met Leu Ala Val Met Lys Lys Leu Thr Tyr Asp Glu Glu Tyr Asn 

380 385 390 

Phe Glu Asn Glu Gly Glu Asp Glu Ala Met Phe Val Glu Tyr Arg 

395 400 405 

Lys Gin Leu Lys Leu Leu Leu Asp Arg Leu Ala Gin Val Ser Pro 

410 415 420 

Glu Leu Leu Leu Ala Ser Val Arg Arg Val Phe Ser Ser Thr Leu 

425 430 435 

Gin Asn Trp Gin Thr Thr Arg Phe Met Glu Val Glu Val Ala lie 

440 445 450 

Arg Leu Leu Tyr Met Leu Ala Glu Ala Leu Pro Val Ser His Gly 

455 460 465 

Ala His Phe Ser Gly Asp Val Ser Lys Ala Ser Ala Leu Gin Asp 

470 475 480 

Met Met Arg Thr Leu Val Thr Ser Gly Val Ser Ser Tyr Gin His 

485 490 495 

Thr Ser Val Thr Leu Glu Phe Phe Glu Thr Val Val Arg Tyr Glu 

500 505 510 

Lys Phe Phe Thr Val Glu Pro Gin His lie Pro Cys Val Leu Met 

515 520 525 

Ala Phe Leu Asp His Arg Gly Leu Arg His Ser Ser Ala Lys Val 

530 535 540 

Arg Ser Arg Thr Ala Tyr Leu Phe Ser Arg Phe Val Lys Ser Leu 

545 550 555 

Asn Lys Gin Met Asn Pro Phe lie Glu Asp lie Leu Asn Arg lie 

560 565 570 

Gin Asp Leu Leu Glu Leu Ser Pro Pro Glu Asn Gly His Gin Ser 

575 580 585 

Leu Leu Ser Ser Asp Asp Gin Leu Phe lie Tyr Glu Thr Ala Gly 

590 595 600 

Val Leu lie Val Asn Ser Glu Tyr Pro Ala Glu Arg Lys Gin Ala 

605 610 615 

Leu Met Arg Asn Leu Leu Thr Pro Leu Met Glu Lys Phe Lys lie 



620 



625 



630 
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Leu Leu Glu Lys Leu Met Leu Ala Gin Asp Glu Glu Arg Gin Ala 

635 640 645 

Ser Leu Ala Asp Cys Leu Asn His Ala Val Gly Phe Ala Ser Arg 

650 655 660 

Thr Ser Lys Ala Phe Ser Asn Lys Gin Thr Val Lys Gin Cys Gly 

665 670 675 

Cys Ser Glu Val Tyr Leu Asp Cys Leu Gin Thr Phe Leu Pro Ala 

680 685 690 

Leu Ser Cys Pro Leu Gin Lys Asp lie Leu Arg Ser Gly Val Arg 

695 700 705 

Thr Phe Leu His Arg Met lie lie Cys Leu Glu Glu Glu Val Leu 

710 715 720 

Pro Phe lie Pro Ser Ala Ser Glu His Met Leu Lys Asp Cys Glu 

725 730 735 

Ala Lys Asp Leu Gin Glu Phe lie Pro Leu lie Asn Gin lie Thr 

740 745 750 

Ala Lys Phe Lys He Gin Val Ser Pro Phe Leu Gin Gin Met Phe 

755 760 765 

Met Pro Leu Leu His Ala He Phe Glu Val Leu Leu Arg Pro Ala 

770 775 780 

Glu Glu Asn Asp Gin Ser Ala Ala Leu Glu Lys Gin Met Leu Arg 

785 790 795 

Arg Ser Tyr Phe Ala Phe Leu Gin Thr Val Thr Gly Ser Gly Met 

800 805 810 

Ser Glu Val He Ala Asn Gin Gly Ala Glu Asn Val Glu Arg Val 

815 820 825 

Leu Val Thr Val He Gin Gly Ala Val Glu Tyr Pro Asp Pro He 

830 835 840 

Ala Gin Lys Thr Cys Phe He He Leu Ser Lys Leu Val Glu Leu 

845 850 855 

Trp Gly Gly Lys Asp Gly Pro Val Gly Phe Ala Asp Phe Val Tyr 

860 865 870 

Lys His He Val Pro Ala Cys Phe Leu Ala Pro Leu Lys Gin Thr 

875 880 885 

Phe Asp Leu Ala Asp Ala Gin Thr Val Leu Ala Leu Ser Glu Cys 

890 895 900 

Ala Val Thr Leu Lys Thr He His Leu Lys Arg Gly Pro Glu Cys 

905 910 915 

Val Gin Tyr Leu Gin Gin Glu Tyr Leu Pro Ser Leu Gin Val Ala 

920 925 930 

Pro Glu He He Gin Glu Phe Cys Gin Ala Leu Gin Gin Pro Asp 

935 940 945 

Ala Lys Val Phe Lys Asn Tyr Leu Lys Val Phe Phe Gin Arg Ala 

950 955 960 

Lys Pro 



<210> 3 

<211> 285 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> Tnisc_f eature 

<223> Incyte Identification No. : 2263514CD1 
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<400> 3 




























Met 


Asp 


Trp 


Val 


Met 


Lys 


His 


Asn 


Gly 


Pro 


Asn 


Asp 


Ala 


Ser 


Asp 


1 








5 










10 










15 


Gly 


Thr 


Val 


Arg 


Leu 
20 


Arg 


Gly 


Leu 


Pro 


Phe 
25 


Gly 


Cys 


Ser 


Lys 


Glu 
30 


Glu 


He 


Val 


Arg 


Val 
35 


Leu 


Ser 


Arg 


Tyr 


He 
40 


Glu 


He 


Phe 


Arg 


Ser 
45 


Ser 


Arg 


Ser 


Glu 


He 
50 


Lys 


Gly 


Phe 


Tyr 


Asp 
55 


Pro 


Pro 


Arg 


Arg 


Leu 

60 


Leu 


Gly 


Gin 


Arg 


Pro 
65 


Gly 


Pro 


Tyr 


Asp 


Arg 
70 


Pro 


He 


Gly 


Gly 


Arg 
75 


Gly 


Gly 


Tyr 


Tyr 


Gly 
80 


Ala 


Gly 


Arg 


Gly 


Ser 
85 


Tyr 


Gly 


Gly 


Phe 


Asp 
90 


Asp 


Tyr 


Gly 


Gly 


Tyr 
95 


Asn 


Asn 


Tyr 


Gly 


Tyr 
100 


Gly 


Asn 


Asp 


Gly 


Phe 
105 


Asp 


Asp 


Arg 


Met 


Arg 
110 


Asp 


Gly 


Arg 


Gly 


Met 
115 


Gly 


Gly 


His 


Gly 


Tyr 
120 


Gly 


Gly 


Ala 


Gly 


Asp 
125 


Ala 


Ser 


Ser 


Gly 


Phe 
130 


His 


Gly 


Gly 


His 


Phe 
135 


Val 


His 


Met 


Arg 


Gly 
140 


Leu 


Pro 


Phe 


Arg 


Ala 
145 


Thr 


Glu 


Asn 


Ala 


He 
150 


Ala 


Asn 


Phe 


Phe 


Ser 
155 


Pro 


Leu 


Asn 


Pro 


He 
160 


Arg 


Val 


His 


He 


Asp 
165 


He 


Gly 


Ala 


Asp 


Gly 
170 


Arg 


Ala 


Thr 


Gly 


Glu 
175 


Ala 


Asp 


Val 


Glu 


Phe 
180 


Val 


Thr 


His 


Glu 


Asp 
185 


Ala 


Val 


Ala 


Ala 


Met 
190 


Ser 


Lys 


Asp 


Lys 


Asn 
195 


Asn 


Met 


Gin 


His 


Arg 
200 


Tyr 


He 


Glu 


Leu 


Phe 
205 


Leu 


Asn 


Ser 


Thr 


Pro 
210 


Gly 


Gly 


Gly 


Ser 


Gly 
215 


Met 


Gly 


Gly 


Ser 


Gly 
220 


Met 


Gly 


Gly 


Tyr 


Gly 
225 


Arg 


Asp 


Gly 


Met 


Asp 
230 


Asn 


Gin 


Gly 


Gly 


Tyr 

235 


Gly 


Ser 


Val 


Gly 


Arg 
240 


Met 


Gly 


Met 


Gly 


Asn 
245 


Asn 


Tyr 


Ser 


Gly 


Gly 
250 


Tyr 


Gly 


Thr 


Pro 


Asp 
255 


Gly 


Leu 


Gly 


Gly 


Tyr 


Gly Arg 


Gly 


Gly 


Gly 


Gly 


Ser 


Gly 


Gly 


Tyr 










260 










265 










270 


Tyr 


Gly 


Gin 


Gly 


Gly 
275 


Met 


Ser 


Gly 


Gly 


Gly 
280 


Trp 


Arg 


Gly 


Met 


Tyr 

285 



<210> 4 

<211> 267 

<212> PRT 

<213> Homo sapiens 

<220> 

<22l> misc_f eature 

<223> Incyte Identification No. : 2738270CD1 
<400> 4 

Met Gly Ala Ala Ala Ala Glu Ala Asp Arg Thr Leu Phe Val Gly 
15 10 15 

Asn Leu Glu Thr Lys Val Thr Glu Glu Leu Leu Phe Glu Leu Phe 

20 25 30 

His Gin Ala Gly Pro Val He Lys Val Lys He Pro Lys Asp Lys 

35 40 45 
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Asp Gly Lys Pro Lys Gin Phe Ala Phe Val Asn Phe Lys His Glu 
50 55 60 

Val Ser- Val Pro Tyr Ala Met Asn Leu Leu Asn Gly lie Lys Leu 
65 70 75 

Tyr Gly Arg Pro lie Lys He Gin Phe Arg Ser Gly Ser Ser His 
80 85 90 

Ala Pro Gin Asp Val Ser Leu Ser Tyr Pro Gin His His Val Gly 
95 100 105 

Asn Ser Ser Pro Thr Ser Thr Ser Pro Ser Ser Arg Tyr Glu Arg 

110 115 120 

Thr Met Asp Asn Met Thr Ser Ser Ala Gin He He Gin Arg Ser 

125 130 135 

Phe Ser Ser Pro Glu Asn Phe Gin Arg Gin Ala Val Met Asn Ser 

140 145 150 

Ala Leu Arg Gin Met Ser Tyr Gly Gly Lys Phe Gly Ser Ser Pro 

155 160 165 

Leu Asp Gin Ser Gly Phe Ser Pro Ser Val Gin Ser His Ser His 

170 175 180 

Ser Phe Asn Gin Ser Ser Ser Ser Gin Trp Arg Gin Gly Thr Pro 

185 190 195 

Ser Ser Gin Arg Lys Val Arg Met Asn Ser Tyr Pro Tyr Leu Ala 

200 205 210 

Asp Arg His Tyr Ser Arg Glu Gin Arg Tyr Thr Asp His Gly Ser 

215 220 225 

Asp His His Tyr Arg Gly Lys Arg Asp Asp Phe Phe Tyr Glu Asp 

230 235 240 

Arg Asn His Asp Asp Trp Ser His Asp Tyr Asp Asn Arg Arg Asp 

245 250 255 

Ser Ser Arg Asp Gly Lys Trp Arg Ser Ser Arg His 

260 265 



<210> 5 
<211> 369 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Identification No.: 2824412CD1 
<400> 5 

Met Pro Pro Gin Pro Gin Gly Pro Ala Pro Leu Arg Arg Pro Asp 
15 10 15 

Ser Ser Asp Asp Arg Tyr Val Met Thr Lys His Ala Thr He Tyr 

20 25 30 

Pro Thr Glu Glu Glu Leu Gin Ala Val Gin Lys He Val Ser He 

35 40 45 

Thr Glu Arg Ala Leu Lys Leu Val Ser Asp Ser Leu Ser Glu His 

50 55 60 

Glu Lys Asn Lys Asn Lys Glu Gly Asp Asp Lys Lys Glu Gly Gly 

65 70 75 

Lys Asp Arg Ala Leu Lys Gly Val Leu Arg Val Gly Val Leu Ala 

80 85 90 

Lys Gly Leu Leu Leu Arg Gly Asp Arg Asn Val Asn Leu Val Leu 
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95 










100 










105 


Leu 


Cys 


Ser 


Glu 


Lys 


Pro 


Ser 


Lys 


Thr 


Leu 


Leu 


Ser 


Arg 


He 


Ala 










110 










115 










120 


Glu 


Asn 


Leu 


Pro 


Lys 


Gin 


Leu 


Ala 


Val 


He 


Ser 


Pro 


Glu 


Lys 


Tyr 










125 










130 










135 


Asp 


He 


Lys 


Cys 


Ala 


Val 


Ser 


Glu 


Ala 


Ala 


He 


He 


Leu 


Asn 


Ser 










140 










145 










150 


Cys 


Val 


Glu 


Pro 


Lys 


Met 


Gin 


Val 


Thr 


He 


Thr 


Leu 


Thr 


Ser 


Pro 










155 










160 










165 


He 


He 


Arg 


Glu 


Glu 


Asn 


Met 


Arg 


Glu 


Gly 


Asp 


Val 


Thr 


Ser 


Gly 










170 










175 










180 


Met 


Val 


Lys 


Asp 


Pro 


Pro 


Asp 


Val 


Leu 


Asp 


Arg 


Gin 


Lys 


Cys 


Leu 










185 










190 










195 


Asp 


Ala 


Leu 


Ala 


Ala 


Leu 


Arg 


His 


Ala 


Lys 


Trp 


Phe 


Gin 


Ala 


Arg 










200 










205 










210 


Ala 


Asn 


Gly 


Leu 


Gin 


Ser 


Cys 


Val 


He 


He 


He 


Arg 


He 


Leu 


Arg 










215 










220 










225 


Asp 


Leu 


Cys 


Gin 


Arg 


Val 


Pro 


Thr 


Trp 


Ser 


Asp 


Phe 


Pro 


Ser 


Trp 










230 










235 










240 


Ala 


Met 


Glu 


Leu 


Leu 


Val 


Glu 


Lys 


Ala 


He 


Ser 


Ser 


Ala 


Ser 


Ser 










245 










250 










255 


Pro 


Gin 


Ser 


Pro 


Gly 


Asp 


Ala 


Leu 


Arg 


Arg 


Val 


Phe 


Glu 


Cys 


He 










260 










265 










270 


Ser 


Ser 


Gly 


He 


He 


Leu 


Lys 


Gly 


Ser 


Pro 


Gly 


Leu 


Leu 


Asp 


Pro 










275 










280 










285 


Cys 


Glu 


Lys 


Asp 


Pro 


Phe 


Asp 


Thr 


Leu 


Ala 


Thr 


Met 


Thr 


Asp 


Gin 










290 










295 










300 


Gin 


Arg 


Glu 


Asp 


He 


Thr 


Ser 


Ser 


Ala 


Gin 


Phe 


Ala 


Leu 


Arg 


Leu 










305 










310 










315 


Leu 


Ala 


Phe 


Arg 


Gin 


He 


His 


Lys 


Val 


Leu 


Gly 


Met 


Asp 


Pro 


Leu 










320 










325 










330 


Pro 


Gin 


Met 


Ser 


Gin 


Arg 


Phe 


Asn 


He 


His 


Asn 


Asn 


Arg 


Lys 


Arg 










335 










340 










345 


Arg 


Arg 


Asp 


Ser 


Asp 


Gly 


Val 


Asp 


Gly 


Phe 


Glu 


Ala 


Glu 


Gly 


Lys 










350 










355 










360 


Lys 


Asp 


Lys 


Lys 


Asp 


Tyr 


Asp 


Asn 


Phe 















<210> 6 
<211> 175 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Identification No.: 002690CD1 
<400> 6 

Met Arg Leu Ser Val Ala Ala Ala He Ser His Gly Arg Val Phe 
15 10 15 

Arg Arg Met Gly Leu Gly Pro Glu Ser Arg He His Leu Leu Arg 

20 25 30 

Asn Leu Leu Thr Gly Leu Val Arg His Glu Arg He Glu Ala Pro 

35 40 45 
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Trp Ala Arg Val Asp Glu Met Arg Gly Tyr Ala Glu Lys Leu lie 

50 55 60 

Asp Tyr Gly Lys Leu Gly Asp Thr Asn Glu Arg Ala Met Arg Met 

65 70 75 

Ala Asp Phe Trp Leu Thr Glu Lys Asp Leu lie Pro Lys Leu Phe 

80 85 90 

Gin Val Leu Ala Pro Arg Tyr Lys Asp Gin Thr Gly Gly Tyr Thr 

95 100 105 

Arg Met Leu Gin lie Pro Asn Arg Ser Leu Asp Arg Ala Lys Met 

110 115 120 

Ala Val lie Glu Tyr Lys Gly Asn Cys Leu Pro Pro Leu Pro Leu 

125 130 135 

Pro Arg Arg Asp Ser His Leu Thr Leu Leu Asn Gin Leu Leu Gin 

140 145 150 

Gly Leu Arg Gin Asp Leu Arg Gin Ser Gin Glu Ala Ser Asn His 

155 160 165 
Ser Ser His Thr Ala Gin Thr Pro Gly lie 
170 175 



<210> 7 
<211> 311 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<:223> Incyte Identification No.: 041108CD1 
<400> 7 

Met Leu Gin Phe Val Arg Ala Gly Ala Arg Ala Trp Leu Arg Pro 

15 10 15 

Thr Gly Ser Gin Gly Leu Ser Ser Leu Ala Glu Glu Ala Ala Arg 

20 25 30 

Ala Thr Glu Asn Pro Glu Gin Val Ala Ser Glu Gly Leu Pro Glu 

35 40 45 

Pro Val Leu Arg Lys Val Glu Leu Pro Val Pro Thr His Arg Arg 

50 55 60 

Pro Val Gin Ala Trp Val Glu Ser Leu Arg Gly Phe Glu Gin Glu 

65 70 75 

Arg Val Gly Leu Ala Asp Leu His Pro Asp Val Phe Ala Thr Ala 

80 85 90 

Pro Arg Leu Asp lie Leu Kis Gin Val Ala Met Trp Gin Lys Asn 

95 100 105 

Phe Lys Arg lie Ser Tyr Ala Lys Thr Lys Thr Arg Ala Glu Val 

110 115 120 

Arg Gly Gly Gly Arg Lys Pro Trp Pro Gin Lys Gly Thr Gly Arg 

125 130 135 

Ala Arg His Gly Ser lie Arg Ser Pro Leu Trp Arg Gly Gly Gly 

140 145 150 

Val Ala His Gly Pro Arg Gly Pro Thr Ser Tyr Tyr Tyr Met Leu 

155 160 165 

Pro Met Lys Val Arg Ala Leu Gly Leu Lys Val Ala Leu Thi' Val 

170 175 180 

Lys Leu Ala Gin Asp Asp Leu His lie Met Asp Ser Leu Glu Leu 
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185 190 195 

Pro Thr Gly Asp Pro Gin Tyr Leu Thr Glu Leu Ala His Tyr Arg 

200 205 210 

Arg Trp Gly Asp Ser Val Leu Leu Val Asp Leu Thr His Glu Glu 

215 220 225 

Met Pro Gin Ser lie Val Glu Ala Thr Ser Arg Leu Lys Thr Phe 

230 235 240 

Asn Leu lie Pro Ala Val Gly Leu Asn Val His Ser Met Leu Lys 

245 250 255 

His Gin Thr Leu Val Leu Thr Leu Pro Thr Val Ala Phe Leu Glu 

260 265 270 

Asp Lys Leu Leu Trp Gin Asp Ser Arg Tyr Arg Pro Leu Tyr Pro 

275 280 285 

Phe Ser Leu Pro Tyr Ser Asp Phe Pro Arg Pro Leu Pro His Ala 

290 295 300 

Thr Gin Gly Pro Ala Ala Thr Pro Tyr His Cys 

305 310 



<210> 8 
<211> 330 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Identification No,: 869138CD1 
<400> 8 

Met Ser Thr Lys Asn Phe Arg Val Ser Asp Gly Asp Trp lie Cys 

15 10 15 

Pro Asp Lys Lys Cys Gly Asn Val Asn Phe Ala Arg Arg Thr Ser 

20 25 30 

Cys Asn Arg Cys Gly Arg Glu Lys Thr Thr Glu Ala Lys Met Met 

35 40 45 

Lys Ala Gly Gly Thr Glu lie Gly Lys Thr Leu Ala Glu Lys Ser 

50 55 60 

Arg Gly Leu Phe Ser Ala Asn Asp Trp Gin Cys Lys Thr Cys Ser 

65 70 75 

Asn Val Asn Trp Ala Arg Arg Ser Glu Cys Asn Met Cys Asn Thr 

80 85 90 

Pro Lys Tyr Ala Lys Leu Glu Glu Arg Thr Gly Tyr Gly Gly Gly 

95 100 105 

Phe Asn Glu Arg Glu Asn Val Glu Tyr lie Glu Arg Glu Glu Ser 

110 115 120 

Asp Gly Glu Tyr Asp Glu Phe Gly Arg Lys Lys Lys Lys Tyr Arg 

125 130 135 

Gly Lys Ala Val Gly Pro Ala Ser lie Leu Lys Glu Val Glu Asp 

140 145 150 

Lys Glu Ser Glu Gly Glu Glu Glu Asp Glu Asp Glu Asp Leu Ser 

155 160 165 

Lys Tyr Lys Leu Asp Glu Asp Glu Asp Glu Asp Asp Ala Asp Leu 

170 175 180 

Ser Lys Tyr Asn Leu Asp Ala Ser Glu Glu Glu Asp Ser Asn Lys 

185 190 195 
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Lys Lys Ser Asn Arg Arg Ser Arg Ser Lys Ser Arg Ser Ser His 

200 205 210 

Ser Arg Ser Ser Ser Arg Ser Ser Ser Pro Ser Ser Ser Arg Ser 

215 220 225 

Arg Ser Arg Ser Arg Ser Arg Ser Ser Ser Ser Ser Gin Ser Arg 

230 235 240 

Ser Arg Ser Ser Ser Arg Glu Arg Ser Arg Ser Arg Gly Ser Lys 

245 250 255 

Ser Arg Ser Ser Ser Arg Ser His Arg Gly Ser Ser Ser Pro Arg 

260 265 270 

Lys Arg Ser Tyr Ser Ser Ser Ser Ser Ser Pro Glu Arg Asn Arg 

275 280 285 

Lys Arg Ser Arg Ser Arg Ser Ser Ser Ser Gly Asp Arg Lys Lys 

290 295 300 

Arg Arg Thr Arg Ser Arg Ser Pro Glu Arg Arg His Arg Ser Ser 

305 310 315 

Ser Gly Ser Ser His Ser Gly Ser Arg Ser Ser Ser Lys Lys Lys 

320 325 330 



<210> 9 
<211> 183 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Identification No. : 934406CD1 
<400> 9 

Met Ser Arg Tyr Leu Arg Pro Pro Asn Thr Ser Leu Phe Val Arg 

15 10 15 

Asn Val Ala Asp Asp Thr Arg Ser Glu Asp Leu Arg Arg Glu Phe 

20 25 30 

Gly Arg Tyr Gly Pro lie Val Asp Val Tyr Val Pro Leu Asp Phe 

35 40 45 

Tyr Thr Arg Arg Pro Arg Gly Phe Ala Tyr Val Gin Phe Glu Asp 

50 55 60 

Val Arg Asp Ala Glu Asp Ala Leu His Asn Leu Asp Arg Lys Trp 

65 70 75 

lie Cys Gly Arg Gin lie Glu lie Gin Phe Ala Gin Gly Asp Arg 

80 85 90 

Lys Thr Pro Asn Gin Met Lys Ala Lys Glu Gly Arg Asn Val Tyr 

95 100 105 

Ser Ser Ser Arg Tyr Asp Asp Tyr Asp Arg Tyr Arg Arg Ser Arg 

110 115 120 

Ser Arg Ser Tyr Glu Arg Arg Arg Ser Arg Ser Arg Ser Phe Asp 

125 130 135 

Tyr Asn Tyr Arg Arg Ser Tyr Ser Pro Arg Asn Ser Arg Pro Thr 

140 145 150 

Gly Arg Pro Arg Arg Ser Arg Ser His Ser Asp Asn Asp Arg Pro 

155 160 165 

Asn Cys Ser Trp Asn Thr Gin Tyr Ser Ser Ala Tyr Tyr Thr Ser 

170 175 180 

Arg Lys He 
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<210> 10 
<211> 670 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Identification No. : 1315083CD1 
<400> 10 

Met Ser His Leu Pro Met Lys Leu Leu Arg Lys Lys lie Glu Lys 

15 10 15 
Arg Asn Leu Lys Leu Arg Gin Arg Asn Leu Lys Phe Gin Gly Ala 

20 25 30 
Ser Asn Leu Thr Leu Ser Glu Thr Gin Asn Gly Asp Val Ser Glu 

35 40 45 

Glu Thr Met Gly Ser Arg Lys Val Lys Lys Ser Lys Gin Lys Pro 

50 55 60 
Met Asn Val Gly Leu Ser Glu Thr Gin Asn Gly Gly Met Ser Gin 

65 70 75 

Glu Ala Val Gly Asn lie Lys Val Thr Lys Ser Pro Gin Lys Ser 

80 85 90 

Thr Val Leu Thr Asn Gly Glu Ala Ala Met Gin Ser Ser Asn Ser 

95 100 105 
Glu Ser Lys Lys Lys Lys Lys Lys Lys Arg Lys Met Val Asn Asp 

110 115 120 

Ala Glu Pro Asp Thr Lys Lys Ala Lys Thr Glu Asn Lys Gly Lys 

125 130 135 

Ser Glu Glu Glu Ser Ala Glu Thr Thr Lys Glu Thr Glu Asn Asn 

140 145 150 

Val Glu Lys Pro Asp Asn Asp Glu Asp Glu Ser Glu Val Pro Ser 

155 160 165 

Leu Pro Leu Gly Leu Thr Gly Ala Phe Glu Asp Thr Ser Phe Ala 

170 175 180 

Ser Leu Cys Asn Leu Val Asn Glu Asn Thr Leu Lys Ala lie Lys 

185 190 195 

Glu Met Gly Phe Thr Asn Met Thr Glu lie Gin His Lys Ser lie 

200 205 210 

Arg Pro Leu Leu Glu Gly Arg Asp Leu Leu Ala Ala Ala Lys Thr 

215 220 225 

Gly Ser Gly Lys Thr Leu Ala Phe Leu lie Pro Ala Val Glu Leu 

230 235 240 

lie Val Lys Leu Arg Phe Met Pro Arg Asn Gly Thr Gly Val Leu 

245 250 255 

lie Leu Ser Pro Thr Arg Glu Leu Ala Met Gin Thr Phe Gly Val 

260 265 270 

Leu Lys Glu Leu Met Thr His His Val His Thr Tyr Gly Leu lie 

275 280 285 

Met Gly Gly Ser Asn Arg Ser Ala Glu Ala Gin Lys Leu Gly Asn 

290 295 300 

Gly lie Asn lie lie Val Ala Thr Pro Gly Arg Leu Leu Asp His 

305 310 315 

Met Gin Asn Thr Pro Gly Phe Met Tyr Lys Asn Leu Gin Cys Leu 

320 325 330 
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Val lie Asp Glu Ala Asp Arg lie Leu Asp Val Gly Phe Glu Glu 

335 340 345 

Glu Leu Lys Gin lie lie Lys Leu Leu Pro Thr Arg Arg Gin Thr 

350 355 360 

Met Leu Phe Ser Ala Thr Gin Thr Arg Lys Val Glu Asp Leu Ala 

365 370 375 

Arg lie Ser Leu Lys Lys Glu Pro Leu Tyr Val Gly Val Asp Asp 

380 385 390 

Asp Lys Ala Asn Ala Thr Val Asp Gly Leu Glu Gin Gly Tyr Val 

395 400 405 

Val Cys Pro Ser Glu Lys Arg Phe Leu Leu Leu Phe Thr Phe Leu 

410 415 420 

Lys Lys Asn Arg Lys Lys Lys Leu Met Val Phe Phe Ser Ser Cys 

425 430 435 

Met Ser Val Lys Tyr His Tyr Glu Leu Leu Asn Tyr lie Asp Leu 

440 445 450 

Pro Val Leu Ala lie His Gly Lys Gin Lys Gin Asn Lys Arg Thr 

455 460 465 

Thr Thr Phe Phe Gin Phe Cys Asn Ala Asp Ser Gly Thr Leu Leu 

470 475 480 

Cys Thr Asp Val Ala Ala Arg Gly Leu Asp lie Pro Glu Val Asp 

485 490 495 

Trp lie Val Gin Tyr Asp Pro Pro Asp Asp Pro Lys Glu Tyr lie 

500 505 510 

His Arg Val Gly Arg Thr Ala Arg Gly Leu Asn Gly Arg Gly His 

515 520 525 

Ala Leu Leu lie Leu Arg Pro Glu Glu Leu Gly Phe Leu Arg Tyr 

530 535 540 

Leu Lys Gin Ser Lys Val Pro Leu Ser Glu Phe Asp Phe Ser Trp 

545 550 555 

Ser Lys lie Ser Asp lie Gin Ser Gin Leu Glu Lys Leu lie Glu 

560 565 570 

Lys Asn Tyr Phe Leu His Lys Ser Ala Gin Glu Ala Tyr Lys Ser 

575 580 585 

Tyr lie Arg Ala Tyr Asp Ser His Ser Leu Lys Gin lie Phe Asn 

590 595 600 

Val Asn Asn Leu Asn Leu Pro Gin Val Ala Leu Ser Phe Gly Phe 

605 610 615 

Lys Val Pro Pro Phe Val Asp Leu Asn Val Asn Ser Asn Glu Gly 

620 625 630 

Lys Gin Lys Lys Arg Gly Gly Gly Gly Gly Phe Gly Tyr Gin Lys 

635 640 645 

Thr Lys Lys Val Glu Lys Ser Lys He Phe Lys His He Ser Lys 

650 655 660 

Lys Ser Ser Asp Ser Arg Gin Phe Ser His 

665 670 



<210> 11 
<211> 452 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc feature 
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<223> Incyte Identification No, : 1444908CD1 
<400> 11 

Met Glu Phe Gin Ala Val Val Met Ala Val Gly Gly Gly Ser Arg 
15 10 15 

Met Thr Asp Leu Thr Ser Ser lie Pro Lys Pro Leu Leu Pro Val 
20 25 30 

Gly Asn Lys Pro Leu lie Trp Tyr Pro Leu Asn Leu Leu Glu Arg 
35 40 45 

Val Gly Phe Glu Glu Val lie Val Val Thr Thr Arg Asp Val Gin 
50 55 60 

Lys Ala Leu Cys Ala Glu Phe Lys Met Lys Met Lys Pro Asp lie 
65 70 75 

Val Cys lie Pro Asp Asp Ala Asp Met Gly Thr Ala Asp Ser Leu 
80 85 90 

Arg Tyr lie Tyr Pro Lys Leu Lys Thr Asp Val Leu Val Leu Ser 
95 100 105 

Cys Asp Leu lie Thr Asp Val Ala Leu His Glu Val Val Asp Leu 

110 115 120 

Phe Arg Ala Tyr Asp Ala Ser Leu Ala Met Leu Met Arg Lys Gly 

125 130 135 

Gin Asp Ser lie Glu Pro Val Pro Gly Gin Lys Gly Lys Lys Lys 

140 145 150 

Ala Val Glu Gin Arg Asp Phe lie Gly Val Asp Ser Thr Gly Lys 

155 160 165 

Arg Leu Leu Phe Met Ala Asn Glu Ala Asp Leu Asp Glu Glu Leu 

170 175 180 

Val lie Lys Gly Ser lie Leu Gin Lys His Pro Arg lie Arg Phe 

185 190 195 

His Thr Gly Leu Val Asp Ala His Leu Tyr Cys Leu Lys Lys Tyr 

200 205 210 

He Val Asp Phe Leu Met Glu Asn Gly Ser He Thr Ser He Arg 

215 220 225 

Ser Glu Leu He Pro Tyr Leu Val Arg Lys Gin Phe Ser Ser Ala 

230 235 240 

Ser Ser Gin Gin Gly Gin Glu Glu Lys Glu Glu Asp Leu Lys Lys 

245 250 255 

Lys Glu Leu Lys Ser Leu Asp He Tyr Ser Phe He Lys Glu Ala 

260 265 270 

Asn Thr Leu Asn Leu Ala Pro Tyr Asp Ala Cys Trp Asn Ala Cys 

275 280 285 

Arg Gly Asp Arg Trp Glu Asp Leu Ser Arg Ser Gin Val Arg Cys 

290 295 300 

Tyr Val His He Met Lys Glu Gly Leu Cys Ser Arg Val Ser Thr 

305 310 315 

Leu Gly Leu Tyr Met Glu Ala Asn Arg Gin Val Pro Lys Leu Leu 

320 325 330 

Ser Ala Leu Cys Pro Glu Glu Pro Pro Val His Ser Ser Ala Gin 

335 340 345 

He Val Ser Lys His Leu Val Gly Val Asp Ser Leu He Gly Pro 

350 355 360 

Glu Thr Gin He Gly Glu Lys Ser Ser He Lys Arg Ser Val He 

365 370 375 

Gly Ser Ser Cys Leu He Lys Asp Arg Val Thr He Thr Asn Cys 

380 385 390 

Leu Leu Met Asn Ser Val Thr Val Glu Glu Gly Ser Asn He Gin 

395 400 405 
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Gly Ser Val lie Cys Asn Asn Ala Val He Glu Lys Gly Ala Asp 

410 415 420 

He Lys Asp Cys Leu He Gly Ser Gly Gin Arg He Glu Ala Lys 

425 430 435 

Ala Lys Arg Val Asn Glu Val He Val Gly Asn Asp Gin Leu Met 

440 445 450 

Glu He 



<210> 12 
<211> 748 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte Identification No.: 1557481CD1 
<400> 12 

Met Ala Asp Ser Ser Gly Gin Gin Gly Lys Gly Arg Arg Val Gin 
15 10 15 

Pro Gin Trp Ser Pro Pro Ala Gly Thr Gin Pro Cys Arg Leu His 
20 25 30 

Leu Tyr Asn Ser Leu Thr Arg Asn Lys Glu Val Phe He Pro Gin 
35 40 45 

Asp Gly Lys Lys Val Thr Trp Tyr Cys Cys Gly Pro Thr Val Tyr 
50 55 60 

Asp Ala Ser His Met Gly His Ala Arg Ser Tyr He Ser Phe Asp 
65 70 75 

He Leu Arg Arg Val Leu Lys Asp Tyr Phe Lys Phe Asp Val Phe 
80 85 90 

Tyr Cys Met Asn He Thr Asp He Asp Asp Lys He He Lys Arg 
95 100 105 

Ala Arg Gin Asn His Leu Phe Glu Gin Tyr Arg Glu Lys Arg Pro 

110 115 120 

Glu Ala Ala Gin Leu Leu Glu Asp Val Gin Ala Ala Leu Lys Pro 

125 130 135 

Phe Ser Val Lys Leu Asn Glu Thr Thr Asp Pro Asp Lys Lys Gin 

140 145 150 

Met Leu Glu Arg He Gin His Ala Val Gin Leu Ala Thr Glu Pro 

155 160 165 

Leu Glu Lys Ala Val Gin Ser Arg Leu Thr Gly Glu Glu Val Asn 

170 175 180 

Ser Cys Val Glu Val Leu Leu Glu Glu Ala Lys Asp Leu Leu Ser 

185 190 195 

Asp Trp Leu Asp Ser Thr Leu Gly Cys Asp Val Thr Asp Asn Ser 

200 205 210 

He Phe Ser Lys Leu Pro Lys Phe Trp Glu Gly Asp Phe His Arg 

215 220 225 

Asp Met Glu Ala Leu Asn Val Leu Pro Pro Asp Val Leu Thr Arg 

230 235 240 

Val Ser Glu Tyr Val Pro Glu He Val Asn Phe Val Gin Lys He 

245 250 255 

Val Asp Asn Gly Tyr Gly Tyr Val Ser Asn Gly Ser Val Tyr Phe 

260 265 270 
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Asp Thr Ala 
Leu Val Pro 
Glu Gly Asp 
Ser Pro Asn 
Pro Ser Trp 
lie Glu Cys 
Asp lie His 
Asn Glu Leu 
Val Arg Tyr 
Lys Met Ser 
Leu Lys Lys 
His Ser Trp 
Ser Ala Leu 
Val Lys Asp 
Glu Lys Trp 
Asp Lys Lys 
Thr Arg Thr 
Asn Leu Tyr 
Gin Ala Leu 
Lys lie Phe 
Val Gly Gly 
Pro Tyr Leu 
lie Ala Arg 
Ala Leu Arg 
Asp His Glu 
Thr Leu Leu 
Lys Arg Lys 
Glu Ala Ala 
Phe Leu Ser 



Lys Phe Ala 

275 
Glu Ala Val 

290 
Leu Ser lie 

305 
Asp Phe Ala 

320 
Pro Cys Pro 

335 
Ser Ala Met 

350 
Gly Gly Gly 

365 
Ala Gin Ser 

380 
Phe Leu His 

395 
Lys Ser Leu 

410 
His Ser Ala 

425 
Lys Asp Thr 

440 
Gin Tyr Glu 

455 
lie Leu Arg 

470 
Gly Glu Glu 

485 
Thr Ala lie 

500 
Val Met Glu 

515 
Met Ala Ala 

530 
Leu Glu Asn 

545 
Gly Ala Val 

560 
Pro Gly Thr 

575 
Gin Val Leu 

590 
Glu Gin Lys 

605 
Asp Asn lie 

620 
Gly Leu Pro 

635 
Lys Glu Arg 

650 
Lys Lys Glu 

665 
Lys Leu Ala 

680 
Glu Thr Asp 



Ser Ser Glu 
Gly Asp Gin 
Ser Ala Asp 
Leu Trp Lys 
Trp Gly Lys 
Ala Gly Thr 
Phe Asp Leu 
Glu Ala Tyr 
Thr Gly His 
Lys Asn Phe 
Arg Gin Leu 
Leu Asp Tyr 
Lys Phe Leu 
Ala Pro Val 
Glu Ala Glu 
His Lys Ala 
Glu Met Arg 
Arg Lys Ala 
lie Ala Leu 
Glu Glu Asp 
Ser Leu Ser 
Ser Glu Phe 
Val Pro Glu 
Leu Pro Glu 
Thr Val Val 
Glu Glu Lys 
Glu Ala Ala 
Lys Met Lys 
Lys Tyr Ser 



Lys His Ser 
280 

Lys Ala Leu 
295 

Arg Leu Ser 
310 

Ala Ser Lys 
325 

Gly Arg Pro 
340 

Leu Leu Gly 
355 

Arg Phe Pro 
370 

Phe Glu Asn 
385 

Leu Thr lie 
400 

lie Thr He 
415 

Arg Leu Ala 
430 

Ser Ser Asn 
445 

Asn Glu Phe 
460 

Asp He Thr 
475 

Leu Asn Lys 
490 

Leu Cys Asp 
505 

Ala Leu Val 
520 

Val Arg Lys 
535 

Tyr Leu Thr 
550 

Ser Ser Leu 
565 

Leu Glu Ala 
580 

Arg Glu Gly 
595 

He Leu Gin 
610 

Leu Gly Val 
625 

Lys Leu Val 
640 

Arg Arg Val 
655 

Arg Arg Lys 
670 

He Pro Pro 
685 

Lys Phe Asp 



Tyr Gly Lys 
285 

Gin Glu Gly 
300 

Glu Lys Arg 
315 

Pro Gly Glu 
330 

Gly Trp His 
345 

Ala Ser Met 
360 

His His Asp 
375 

Asp Cys Trp 
390 

Ala Gly Cys 
405 

Lys Asp Ala 
420 

Phe Leu Met 
435 

Thr Met Glu 
450 

Phe Leu Asn 
465 

Gly Gin Phe 
480 

Asn Phe Tyr 
495 

Asn Val Asp 

510 

Ser Gin Cys 
525 

Arg Pro Asn 
540 

His Met Leu 
555 

Gly Phe Pro 
570 

Thr Val Met 
585 

Val Arg Lys 
600 

Leu Ser Asp 
615 

Arg Phe Glu 
630 

Asp Arg Asn 
645 

Glu Glu Glu 
660 

Gin Glu Gin 
675 

Ser Glu Met 
690 

Glu Asn Gly 
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695 

Leu Pro Thr His Asp Met Glu Gly 
710 

Ala Lys Lys Leu Lys Lys Leu Phe 
725 

Lys Glu Tyr Leu Gin Met Ala Gin 
740 



700 705 

Lys Glu Leu Ser Lys Gly Gin 

715 720 

Glu Ala Gin Glu Lys Leu Tyr 

730 735 
Asn Gly Ser Phe Gin 

745 



<210> 13 
<211> 328 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mxsc_f eature 

<223> Incyte Identification No. : 1747456CD1 
<400> 13 



Met 


Glu 


Ala 


Asn 


Gly 


Ser 


Gin 


Gly 


Thr 


Ser 


Gly 


Ser 


Ala 


Asn 


Asp 


1 








5 










10 










15 


Ser 


Gin 


His 


Asp 


Pro 
20 


Gly 


Lys 


Met 


Phe 


He 
25 


Gly 


Gly 


Leu 


Ser 


Trp 
30 


Gin 


Thr 


Ser 


Pro 


Asp 
35 


Ser 


Leu 


Arg 


Asp 


Tyr 
40 


Phe 


Ser 


Lys 


Phe 


Gly 
45 


Glu 


He 


Arg 


Glu 


Cys 
50 


Met 


Val 


Met 


Arg 


Asp 
55 


Pro 


Thr 


Thr 


Lys 


Arg 

60 


Ser 


Arg 


Gly 


Phe 


Gly 
65 


Phe 


Val 


Thr 


Phe 


Ala 
70 


Asp 


Pro 


Ala 


Ser 


Val 

75 


Asp 


Lys 


Val 


Leu 


Gly 
80 


Gin 


Pro 


His 


His 


Glu 
85 


Leu 


Asp 


Ser 


Lys 


Thr 
90 


He 


Asp 


Pro 


Lys 


Val 
95 


Ala 


Phe 


Pro 


Arg 


Arg 
100 


Ala 


Gin 


Pro 


Lys 


Met 
105 


Val 


Thr 


Arg 


Thr 


Lys 
110 


Lys 


He 


Phe 


Val 


Gly 
115 


Gly 


Leu 


Ser 


Ala 


Asn 
120 


Thr 


Val 


Val 


Glu 


Asp 
125 


Val 


Lys 


Gin 


Tyr 


Phe 
130 


Glu 


Gin 


Phe 


Gly 


Lys 
135 


Val 


Glu 


Asp 


Ala 


Met 
140 


Leu 


Met 


Phe 


Asp 


Lys 
145 


Thr 


Thr 


Asn 


Arg 


His 
150 


Arg 


Gly 


Phe 


Gly 


Phe 
155 


Val 


Thr 


Phe 


Glu 


Asn 
160 


Glu 


Asp 


Val 


Val 


Glu 
165 


Lys 


Val 


Cys 


Glu 


He 
170 


His 


Phe 


His 


Glu 


He 
175 


Asn 


Asn 


Lys 


Met 


Val 
180 


Glu 


Cys 


Lys 


Lys 


Ala 
185 


Gin 


Pro 


Lys 


Glu 


Val 
190 


Met 


Phe 


Pro 


Pro 


Gly 
195 


Thr 


Arg 


Gly 


Arg 


Ala 
200 


Arg 


Gly 


Leu 


Pro 


Tyr 
205 


Thr 


Met 


Asp 


Ala 


Phe 
210 


Met 


Leu 


Gly Met 


Gly Met 


Leu 


Gly 


Tyr 


Pro 


Asn 


Phe 


Val 


Ala 


Thr 










215 










220 










225 


Tyr Gly Arg 


Gly 


Tyr 


Pro 


Gly 


Phe 


Ala 


Pro 


Ser 


Tyr 


Gly 


Tyr 


Gin 










230 










235 










240 


Phe 


Pro 


Gly 


Phe 


Pro 
245 


Ala 


Ala 


Ala 


Tyr 


Gly 
250 


Pro 


Val 


Ala 


Ala 


Ala 

255 


Ala 


Val 


Ala 


Ala 


Ala 
260 


Arg 


Gly 


Ser 


Gly 


Ser 
265 


Asn 


Pro 


Ala 


Arg 


Pro 
270 
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Gly Gly Phe Pro Gly Ala Asn Ser Pro Gly Pro Val Ala Asp Leu 

275 280 285 

Tyr Gly Pro Ala Ser Gin Asp Ser Gly Val Gly Asn Tyr lie Ser 

290 295 300 

Ala Ala Ser Pro Gin Pro Gly Ser Gly Phe Gly His Gly lie Ala 

305 310 315 
Gly Pro Leu lie Ala Thr Ala Phe Thr Asn Gly Tyr His 

320 325 



<210> 14 

<211> 563 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte Identification No. : 1748626CD1 



<400> 14 
Met Pro Glu 
1 

Asn Leu Asp 
Glu Lys Tyr 
Arg Gly Gin 
Asp Met Ala 
Gly Arg Asn 
Thr Arg Leu 
Ala Leu Ala 
Asp His Val 
Leu Asp Ala 
Leu Gly Gly 
Glu Glu Thr 
Val His Tyr 
Leu Asp Ala 
Tyr Ser Asp 
Pro Ser Lys 
Arg Ser Val 
Asp Arg Gly 



Asp Asp Gin 
5 

His Ser Val 
20 

Gly lie lie 
35 

Gly Gly Ala 
50 

His Arg Ala 
65 

Pro lie Lys 
80 

Trp Val Gly 
95 

Arg Glu Phe 

110 
Lys Gly Asp 

125 

Ala Gin Ala 

140 
Pro Asp Arg 

155 

Arg Tyr Pro 

170 
Glu Leu Leu 

185 
Asp Leu Val 

200 

Arg Asp Arg 

215 
Ser Ser Asp 

230 
Arg Ser Arg 

245 
Leu Pro Lys 



Arg Ala Thr 
Ser Glu Val 
Glu Glu Val 
Tyr Ala Phe 
Lys Val Ala 
lie Gly Tyr 
Gly Leu Gly 
Asp Arg Phe 
Ser Phe Ala 
Ala Cys Ala 
Arg Leu Arg 
Gin Gin Tyr 
Thr Asp Gly 
Arg Asp Arg 
Thr Phe Leu 
Arg Arg Asn 
Ser Gly Glu 
Pro Trp Glu 



Arg Asn Leu 
10 

Glu Leu Arg 
25 

Val lie Lys 
40 

Leu Lys Phe 
55 

Met Ser Gly 
70 

Gly Lys Ala 
85 

Pro Asn Thr 
100 

Gly Ser lie 
115 

Tyr lie Gin 
130 

Lys Met Arg 
145 

Val Asp Phe 
160 

Gin Pro Ser 
175 

Tyr Thr Arg 
190 

Thr Pro Pro 
205 

Glu Gly Asp 
220 

Ser Leu Glu 
235 

Arg Trp Gly 
250 

Glu Arg Arg 



Phe lie Gly 
15 

Arg Ala Phe 
30 

Arg Pro Ala 
45 

Gin Asn Leu 
60 

Arg Val lie 
75 

Asn Pro Thr 
90 

Ser Leu Ala 
105 

Arg Thr lie 
120 

Tyr Glu Ser 
135 

Gly Phe Pro 
150 

Ala Lys Ala 
165 

Pro Leu Pro 
180 

His Arg Asn 
195 

His Leu Leu 
210 

Trp Thr Ser 

225 

Gly Tyr Ser 
240 

Ala Asp Gly 
255 

Lys Arg Arg 
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260 265 270 

Ser Leu Ser Ser Asp Arg Gly Arg Thr Thr His Ser Pro Tyr Glu 

275 280 285 

Glu Arg Ser Arg Thr Lys Gly Ser Gly Gin Gin Ser Glu Arg Gly 

290 295 300 

Ser Asp Arg Thr Pro Glu Arg Ser Arg Lys Glu Asn His Ser Ser 

305 310 315 

Glu Gly Thr Lys Glu Ser Ser Ser Asn Ser Leu Ser Asn Ser Arg 

320 325 330 

His Gly Ala Glu Glu Arg Gly His His His His His His Glu Ala 

335 340 345 

Ala Asp Ser Ser His Gly Lys Lys Ala Arg Asp Ser Glu Arg Asn 

350 355 360 

His Arg Thr Thr Glu Ala Glu Pro Lys Pro Leu Glu Glu Pro Lys 

365 370 375 

His Glu Thr Lys Lys Leu Lys Asn Leu Ser Glu Tyr Ala Gin Thr 

380 385 390 

Leu Gin Leu Gly Trp Asn Gly Leu Leu Val Leu Lys Asn Ser Cys 

395 400 405 

Phe Pro Thr Ser Met His lie Leu Glu Gly Asp Gin Gly Val lie 

410 415 420 

Ser Ser Leu Leu Lys Asp His Thr Ser Gly Ser Lys Leu Thr Gin 

425 430 435 

Leu Lys lie Ala Gin Arg Leu Arg Leu Asp Gin Pro Lys Leu Asp 

440 445 450 

Glu Val Thr Arg Arg lie Lys Gin Gly Ser Pro Asn Gly Tyr Ala 

455 460 465 

Val Leu Leu Ala Thr Gin Ala Thr Pro Ser Gly Leu Gly Thr Glu 

470 475 480 

Gly Met Pro Thr Val Glu Pro Gly Leu Gin Arg Arg Leu Leu Arg 

485 490 495 

Asn Leu Val Ser Tyr Leu Lys Gin Lys Gin Ala Ala Gly Val lie 

500 505 510 

Ser Leu Pro Val Gly Gly Ser Lys Gly Arg Asp Gly Thr Gly Met 

515 520 525 

Leu Tyr Ala Phe Pro Pro Cys Asp Phe Ser Gin Gin Tyr Leu Gin 

530 535 540 

Ser Ala Leu Arg Thr Leu Gly Lys Leu Glu Glu Glu His Met Val 

545 550 555 

lie Val lie Val Arg Asp Thr Ala 

560 



<210> 15 
<211> 153 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Xncyte Identification No.: 1879135CD1 
<400> 15 

Met Met Ser Gin Ser Gly His Glu Tyr Asp Pro lie Asn Tyr Met 
15 10 15 
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Lys Lys Pro Leu Gly Pro Pro Pro Pro Ser Tyr Thr Cys Phe Arg 

20 25 30 

Cys Gly Lys Pro Gly His Tyr lie Lys Asn Cys Pro Thr Asn Gly 

35 40 45 
Asp Lys Asn Phe Glu Ser Gly Pro Arg lie Lys Lys Ser Thr Gly 

50 55 60 

lie Pro Arg Ser Phe Met Met Glu Val Lys Asp Pro Asn Met Lys 

65 70 75 

Gly Ala Met Leu Thr Asn Thr Gly Lys Tyr Ala lie Pro Thr lie 

80 85 90 

Asp Ala Glu Ala Tyr Ala lie Gly Lys Lys Glu Lys Pro Pro Phe 

95 100 105 

Leu Pro Glu Glu Pro Ser Ser Ser Ser Glu Glu Asp Asp Pro lie 

110 115 120 

Pro Asp Glu Leu Leu Cys Leu lie Cys Lys Asp lie Met Thr Asp 

125 130 135 

Ala Val Val lie Pro Cys Cys Gly Asn Ser Tyr Cys Asp Glu Cys 

140 145 150 

Lys Lys Cys 



<21Q> 16 
<211> 286 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte Identification No. : 2073417CD1 
<400> 16 

Met Ser Trp Leu Leu Phe Leu Ala His Arg Val Ala Leu Ala Ala 

15 10 15 

Leu Pro Cys Arg Arg Gly Ser Arg Gly Phe Gly Met Phe Tyr Ala 

20 25 30 

Val Arg Arg Gly Arg Lys Thr Gly Val Phe Leu Thr Trp Asn Glu 

35 40 45 

Cys Arg Ala Gin Val Asp Arg Phe Pro Ala Ala Arg Phe Lys Lys 

50 55 60 

Phe Ala Thr Glu Asp Glu Ala Trp Ala Phe Val Arg Lys Ser Ala 

65 70 75 

Ser Pro Glu Val Ser Glu Gly His Glu Asn Gin His Gly Gin Glu 

80 85 90 

Ser Glu Ala Lys Ala Ser Lys Arg Leu Arg Glu Pro Leu Asp Gly 

95 100 105 

Asp Gly His Glu Ser Ala Glu Pro Tyr Ala Lys His Met Lys Pro 

110 115 120 

Ser Met Glu Pro Ala Pro Pro Val Ser Arg Asp Thr Phe Ser Tyr 

125 130 135 

Met Gly Asp Phe Val Val Val Tyr Thr Asp Gly Cys Cys Ser Ser 

140 145 150 

Asn Gly Arg Arg Arg Pro Arg Ala Gly lie Gly Val Tyr Trp Gly 

155 160 165 

Pro Gly His Pro Leu Asn Val Gly lie Arg Leu Pro Gly Arg Gin 

170 175 180 
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Thr Asn Gin Arg Ala Glu He His Ala Ala Cys Lys Ala He Glu 

185 190 195 

Gin Ala Lys Thr Gin Asn He Asn Lys Leu Val Leu Tyr Thr Asp 

200 205 210 

Ser Met Phe Thr He Asn Gly He Thr Asn Trp Val Gin Gly Trp 

215 220 225 

Lys Lys Asn Gly Trp Lys Thr Ser Ala Gly Lys Glu Val He Asn 

230 235 240 

Lys Glu Asp Phe Val Ala Leu Glu Arg Leu Thr Gin Gly Met Asp 

245 250 255 

He Gin Trp Met His Val Pro Gly His Ser Gly Phe He Gly Asn 

260 265 270 

Glu Glu Ala Asp Arg Leu Ala Arg Glu Gly Ala Lys Gin Ser Glu 

275 280 285 

Asp 



<210> 17 
<211> 537 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Identification No.: 2129080CD1 
<400> 17 

Met Leu Ala Arg Glu Thr Tyr Glu Glu Asp Arg Glu Tyr Glu Ser 
15 10 15 

Gin Ala Lys Arg Leu Lys Thr Glu Glu Gly Glu He Asp Tyr Ser 
20 25 30 

Ala Glu Glu Gly Glu Asn Arg Arg Glu Ala Thr Pro Arg Gly Gly 
35 40 45 

Gly Asp Gly Gly Gly Gly Gly Arg Ser Phe Ser Gin Pro Glu Ala 
50 55 60 

Gly Gly Ser His His Lys Val Ser Val Ser Pro Val Val His Val 
65 70 75 

Arg Gly Leu Cys Glu Ser Val Val Glu Ala Asp Leu Val Glu Ala 
80 85 90 

Leu Glu Lys Phe Gly Thr He Cys Tyr Val Met Met Met Pro Phe 
95 100 105 

Lys Arg Gin Ala Leu Val Glu Phe Glu Asn He Asp Ser Ala Lys 
110 115 120 

Glu Cys Val Thr Phe Ala Ala Asp Glu Pro Val Tyr He Ala Gly 
125 130 135 

Gin Gin Ala Phe Phe Asn Tyr Ser Thr Ser Lys Arg He Thr Arg 
140 145 150 

Pro Gly Asn Thr Asp Asp Pro Ser Gly Gly Asn Lys Val Leu Leu 
155 160 165 

Leu Ser He Gin Asn Pro Leu Tyr Pro He Thr Val Asp Val Leu 
170 175 180 

Tyr Thr Val Cys Asn Pro Val Gly Lys Val Gin Arg He Val He 
185 190 195 

Phe Lys Arg Asn Gly He Gin Ala Met Val Glu Phe Glu Ser Val 
200 205 210 
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Leu Cys Ala Gin Lys Ala Lys Ala Ala Leu Asn Gly Ala Asp lie 

215 220 225 

Tyr Ala Gly Cys Cys Thr Leu Lys lie Glu Tyr Ala Arg Pro Thr 

230 235 240 

Arg Leu Asn Val lie Arg Asn Asp Asn Asp Ser Trp Asp Tyr Thr 

245 250 255 

Lys Pro Tyr Leu Gly Arg Arg Asp Arg Gly Lys Gly Arg Gin Arg 

260 265 270 

Gin Ala lie Leu Gly Glu His Pro Ser Ser Phe Arg His Asp Gly 

275 280 285 

Tyr Gly Ser His Gly Pro Leu Leu Pro Leu Pro Ser Arg Tyr Arg 

290 295 300 

Met Gly Ser Arg Asp Thr Pro Glu Leu Val Ala Tyr Pro Leu Pro 

305 310 315 

Gin Ala Ser Ser Ser Tyr Met His Gly Gly Asn Pro Ser Gly Ser 

320 325 330 

Val Val Met Val Ser Gly Leu His Gin Leu Lys Met Asn Cys Ser 

335 340 345 

Arg Val Phe Asn Leu Phe Cys Leu Tyr Gly Asn lie Glu Lys Val 

350 355 360 

Lys Phe Met Lys Thr lie Pro Gly Thr Ala Leu Val Glu Met Gly 

365 370 375 

Asp Glu Tyr Ala Val Glu Arg Ala Val Thr His Leu Asn Asn Val 

380 385 390 

Lys Leu Phe Gly Lys Arg Leu Asn Val Cys Val Ser Lys Gin His 

395 400 405 

Ser Val Val Pro Ser Gin lie Phe Glu Leu Glu Asp Gly Thr Ser 

410 415 420 

Ser Tyr Lys Asp Phe Ala Met Ser Lys Asn Asn Arg Phe Thr Ser 

425 430 435 

Ala Gly Gin Ala Ser Lys Asn He He Gin Pro Pro Ser Cys Val 

440 445 450 

Leu His Tyr Tyr Asn Val Pro Leu Cys Val Thr Glu Glu Thr Phe 

455 460 465 

Thr Lys Leu Cys Asn Asp His Glu Val Leu Thr Phe He Lys Tyr 

470 475 480 

Lys Val Phe Asp Ala Lys Pro Ser Ala Lys Thr Leu Ser Gly Leu 

485 490 495 

Leu Glu Trp Glu Cys Lys Thr Asp Ala Val Glu Ala Leu Thr Ala 

500 505 510 

Leu Asn His Tyr Gin He Arg Val Pro Asn Gly Ser Asn Pro Tyr 

515 520 525 

Thr Leu Lys Leu Cys Phe Ser Thr Ser Ser His Leu 

530 535 



<210> 18 
<211> 163 
<212> PRT 

<213> Homo sapiens 
<220> 

<22l> misc_f eature 

<223> Incyte Identification No.: 2472867CD1 
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<400> 18 


























Met 


Arg 


lie 


Glu 


Lys 


Cys 


Tyr 


Phe 


Cys 


Ser 


Gly 


Pro 


He 


Tyr 


Pro 


1 








5 










10 










15 


Gly 


His 


Gly 


Met 


Met 
20 


Phe 


Val 


Arg 


Asn 


Asp 
25 


Cys 


Lys 


Val 


Phe 


Arg 
30 


Phe 


Cys 


Lys 


Ser 


Lys 
35 


Cys 


His 


Lys 


Asn 


Phe 
40 


Lys 


Lys 


Lys 


Arg 


Asn 
45 


Pro 


Arg 


Lys 


Val 


Arg 
50 


Trp 


Thr 


Lys 


Ala 


Phe 
55 


Arg 


Lys 


Ala 


Ala 


Gly 
60 


Lys 


Glu 


Leu 


Thr 


Val 
65 


Asp 


Asn 


Ser 


Phe 


Glu 
70 


Phe 


Glu 


Lys 


Arg 


Arg 
75 


Asn 


Glu 


Pro 


He 


Lys 
80 


Tyr 


Gin 


Arg 


Glu 


Leu 
85 


Trp 


Asn 


Lys 


Thr 


He 
90 


Asp 


Ala 


Met 


Lys 


Arg 
95 


Val 


Glu 


Glu 


He 


Lys 
100 


Gin 


Lys 


Arg 


Gin 


Ala 
105 


Lys 


Phe 


lie 


Met 


Asn 
110 


Arg 


Leu 


Lys 


Lys 


Asn 
115 


Lys 


Glu 


Leu 


Gin 


Lys 
120 


Val 


Gin 


Asp 


He 


Lys 
125 


Glu 


Val 


Lys 


Gin 


Asn 
130 


He 


His 


Leu 


He 


Arg 
135 


Ala 


Pro 


Leu 


Ala 


Gly 
140 


Lys 


Gly 


Lys 


Gin 


Leu 
145 


Glu 


Glu 


Lys 


Met 


Val 

150 


Gin 


Gin 


Leu 


Gin 


Glu 
155 


Asp 


Val 


Asp 


Met 


Glu 
160 


Asp 


Ala 


Pro 







<210> 19 
<211> 178 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte Identification No. : 2764755CD1 
<400> 19 

Met Ser Ser Phe Ser Arg Ala Pro Gin Gin Trp Ala Thr Phe Ala 

15 10 15 

Arg He Trp Tyr Leu Leu Asp Gly Lys Met Gin Pro Pro Gly Lys 

20 25 30 

Leu Ala Ala Met Ala Ser He Arg Leu Gin Gly Leu His Lys Pro 

35 40 45 

Val Tyr His Ala Leu Ser Asp Cys Gly Asp His Val Val He Met 

50 55 60 

Asn Thr Arg His He Ala Phe Ser Gly Asn Lys Trp Glu Gin Lys 

65 70 75 

Val Tyr Ser Ser His Thr Gly Tyr Pro Gly Gly Phe Arg Gin Val 

80 85 90 

Thr Ala Ala Gin Leu His Leu Arg Asp Pro Val Ala He Val Lys 

95 100 105 

Leu Ala He Tyr Gly Met Leu Pro Lys Asn Leu His Arg Arg Thr 

110 115 120 

Met Met Glu Arg Leu His Leu Phe Pro Asp Glu Tyr He Pro Glu 

125 130 135 

Asp He Leu Lys Asn Leu Val Glu Glu Leu Pro Gin Pro Arg Lys 

140 145 150 
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lie Pro Lys Arg Leu 
155 

Phe Pro Arg Leu Trp 
170 



Asp Glu Tyr Thr Girl 
160 

Thr Pro Pro Glu Asp 
175 



Glu Glu lie Asp Ala 
165 

Tyr Arg Leu 



<210> 20 
<211> 140 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte Identification No. : 2875939CD1 
<400> 20 

Met Ala Glu Asn Arg Glu Pro Arg Gly Ala Val Glu Ala Glu Leu 

15 10 15 

Asp Pro Val Glu Tyr Thr Leu Arg Lys Arg Leu Pro Ser Arg Leu 

20 25 30 

Pro Arg Arg Pro Asn Asp lie Tyr Val Asn Met Lys Thr Asp Phe 

35 40 45 

Lys Ala Gin Leu Ala Arg Cys Gin Lys Leu Leu Asp Gly Gly Ala 

50 55 60 

Arg Gly Gin Asn Ala Cys Ser Glu lie Tyr lie His Gly Leu Gly 

65 70 75 

Leu Ala lie Asn Arg Ala lie Asn lie Ala Leu Gin Leu Gin Ala 

80 85 90 

Gly Ser Phe Gly Ser Leu Gin Val Ala Ala Asn Thr Ser Thr Val 

95 100 105 

Glu Leu Val Asp Glu Leu Glu Pro Glu Thr Asp Thr Arg Glu Pro 

110 115 120 

Leu Thr Arg lie Arg Asn Asn Ser Ala lie His lie Arg Val Phe 

125 130 135 
Arg Val Thr Pro Lys 
140 



<210> 21 
<211> 209 
<212> PRT 

<213> Homo sapiens 
<220> 

<22l> misc_feature 

<223> Incyte Identification No. : 3591363CD1 
<400> 21 

Met Ala Ala Ala Met Ala Ala Ser Ser Leu Thr Val Thr Leu Gly 
15 10 15 

Arg Leu Ala Ser Ala Cys Ser His Ser lie Leu Arg Pro Ser Gly 
20 25 30 

Pro Gly Ala Ala Ser Leu Trp Ser Ala Ser Arg Arg Phe Asn Ser 



23/47 



wo 00/11171 



PCT/US99/19361 



35 40 45 

Gin Ser Thr Ser Tyr Leu Pro Gly Tyr Val Pro Lys Thr Ser Leu 

50 55 60 

Ser Ser Pro Pro Trp Pro Glu Val Val Leu Pro Asp Pro Val Glu 

65 70 75 

Glu Thr Arg His His Ala Glu Val Val Lys Lys Val Asn Glu Met 

80 85 90 

lie Val Thr Gly Gin Tyr Gly Arg Leu Phe Ala Val Val His Phe 

95 100 105 

Ala Ser Arg Gin Trp Lys Val Thr Ser Glu Asp Leu lie Leu lie 

110 115 120 

Gly Asn Glu Leu Asp Leu Ala Cys Gly Glu Arg lie Arg Leu Glu 

125 130 135 

Lys Val Leu Leu Val Gly Ala Asp Asn Phe Thr Leu Leu Gly Lys 

140 145 150 

Pro Leu Leu Gly Lys Asp Leu Val Arg Val Glu Ala Thr Val lie 

155 160 165 

Glu Lys Thr Glu Ser Trp Pro Arg lie lie Met Arg Phe Arg Lys 

170 175 180 

Arg Lys Asn Phe Lys Lys Lys Arg lie Val Thr Thr Pro Gin Thr 

185 190 195 

Val Leu Arg lie Asn Ser lie Glu lie Ala Pro Cys Leu Leu 

200 205 



<210> 22 
<211> 162 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Identification No. : 3702292CD1 
<400> 22 

Met Ala Pro Lys Ala Lys Glu Ala Pro Ala His Pro Lys Ala Glu 

15 10 15 

Ala Lys Ala Lys Ala Leu Lys Ala Lys Lys Ala Val Leu Lys Gly 

20 25 30 

Val Arg Ser His Thr Gin Lys Gin Lys lie Arg Met Ser Leu Thr 

35 40 45 

Phe Arg Arg Pro Lys Thr Leu Arg Leu Arg Arg Gin Pro Arg Tyr 

50 55 60 

Pro Arg Lys Ser Thr Pro Arg Arg Asn Lys Leu Gly His Tyr Ala 

65 70 75 

lie lie Lys Phe Pro Leu Ala Thr Glu Ser Ala Val Lys Lys lie 

80 85 90 

Glu Glu Asn Asn Thr Leu Val Phe Thr Val Asp Val Lys Ala Asn 

95 100 105 

Lys His Gin lie Arg Gin Ala Val Lys Lys Leu Tyr Asp Ser Asp 

110 115 120 

Val Ala Lys Val Thr Thr Leu lie Cys Pro Asp Lys Glu Asn Lys 

125 130 135 

Ala Tyr Val Arg Leu Ala Pro Asp Tyr Asp Ala Phe Asp Val Val 

140 145 150 
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Thr Lys Leu Gly Ser Pro Lys Leu Ser Pro Ala Gly 
155 160 



<210> 23 
<211> 623 
<212> PRT 

<213> Homo sapiens 



<220> 

■^221> misc_f eature 

<223> Incyte Identification No. : 3778908CD1 



<400> 23 
Met Ala Thr 
1 

Asp Thr Thr 
Leu Asp Ala 
lie Tyr Val 
Ala lie Glu 
Val Leu Gin 
Lys Ser Ala 
Glu Lys Gin 
Glu Ala Lys 
Asp Val Thr 
Ser Val Tyr 
Val Gly Lys 
Leu Phe Glu 
Asp Pro Leu 
Cys Thr Lys 
His Glu lie 
Ala Asn Asn 
Lys Glu Gin 
Thr Asp Val 
Arg Gly Phe 
Gin Ala Arg 



Glu His Val 
5 

Ser Ala Val 
20 

Gly Leu Pro 
35 

Ala Gly Leu 
50 

Ala Leu Lys 
65 

Gin Phe Lys 
80 

Phe Leu Cys 
95 

Gly Thr Lys 

110 
lie Lys Al a 

125 

Thr Gly Gin 

140 
Ser Gly Gin 

155 
lie Pro Arg 

170 

Lys Ala Gly 
185 

Thr Gly Leu 

200 
Glu Ala Ala 

215 
Arg Ser Gly 

230 
Arg Leu Phe 

245 
lie Leu Glu 

260 
lie Leu Tyr 

275 
Cys Phe Leu 

290 
Arg Arg Leu 



Asn Gly Asn 
lie His Ser 
Gin Lys Val 
Val Ala His 
Glu Phe Asn 
Asp Ser Asp 
Gly Val Met 
Val Ala Asp 
Leu Leu Glu 
Arg Lys Tyr 
Gin Pro Ser 
Asp Leu Phe 
Pro lie Trp 
Asn Arg Gly 
Gin Glu Ala 
Lys His lie 
Val Gly Ser 
Glu Phe Ser 
His Gin Pro 
Glu Tyr Glu 
Met Ser Gly 



Gly Thr Glu 
10 

Glu Asn Phe 

25 

Ala Glu Lys 
40 

Ser Asp Leu 
55 

Glu Asp Gly 
70 

Leu Ser His 
85 

Lys Thr Tyr 
100 

Ser Ser Lys 
115 

Arg Thr Gly 
130 

Gly Gly Pro 
145 

Val Gly Thr 
160 

Glu Asp Glu 
175 

Asp Leu Arg 
190 

Tyr Ala Phe 
205 

Val Lys Leu 
220 

Gly Val Cys 
235 

lie Pro Lys 
250 

Lys Val Thr 
265 

Asp Asp Lys 
280 

Asp His Lys 
295 

Lys Val Lys 



Glu Pro Met 
15 

Gin Thr Leu 

30 

Leu Asp Glu 
45 

Asp Glu Arg 
60 

Ala Leu Ala 
75 

Val Gin Asn 
90 

Arg Gin Arg 
105 

Gly Pro Asp 
120 

Tyr Thr Leu 
135 

Pro Pro Asp 
150 

Glu lie Phe 
165 

Leu Val Pro 
180 

Leu Met Met 
195 

Val Thr Phe 
210 

Tyr Asn Asn 
225 

He Ser Val 
240 

Ser Lys Thr 
255 

Glu Gly Leu 
270 

Lys Lys Asn 
285 

Thr Ala Ala 
300 

Val Trp Gly 
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305 310 315 
Asn Val Gly Thr Val Glu Trp Ala Asp Pro lie Glu Asp Pro Asp 

320 325 330 
Pro Glu Val Met Ala Lys Val Lys Val Leu Phe Val Arg Asn Leu 

335 340 345 
Ala Asn Thr Val Thr Glu Glu lie Leu Glu Lys Ala Phe Ser Gin 

350 355 360 
Phe Gly Lys Leu Glu Arg Val Lys Lys Leu Lys Asp Tyr Ala Phe 

365 370 375 

lie His Phe Asp Glu Arg Asp Gly Ala Val Lys Ala Met Glu Glu 

380 385 390 

Met Asn Gly Lys Asp Leu Glu Gly Glu Asn lie Glu lie Val Phe 

395 400 405 

Ala Lys Pro Pro Asp Gin Lys Arg Lys Glu Arg Lys Ala Gin Arg 

410 415 420 

Gin Ala Ala Lys Asn Gin Met Tyr Asp Asp Tyr Tyr Tyr Tyr Gly 

425 430 435 

Pro Pro His Met Pro Pro Pro Thr Arg Gly Arg Gly Arg Gly Gly 

440 445 450 

Arg Gly Gly Tyr Gly Tyr Pro Pro Asp Tyr Tyr Gly Tyr Glu Asp 

455 460 465 

Tyr Tyr Asp Tyr Tyr Gly Tyr Asp Tyr His Asn Tyr Arg Gly Gly 

470 475 480 

Tyr Glu Asp Pro Tyr Tyr Gly Tyr Glu Asp Phe Gin Val Gly Ala 

485 490 495 

Arg Gly Arg Gly Gly Arg Gly Ala Arg Gly Ala Ala Pro Ser Arg 

500 505 510 

Gly Arg Gly Ala Ala Pro Pro Arg Gly Arg Ala Gly Tyr Ser Gin 

515 520 525 

Arg Gly Gly Pro Gly Ser Ala Arg Gly Val Arg Gly Ala Arg Gly 

530 535 540 

Gly Ala Gin Gin Gin Arg Gly Arg Gly Val Arg Gly Ala Arg Gly 

545 550 555 

Gly Arg Gly Gly Asn Val Gly Gly Lys Arg Lys Ala Asp Gly Tyr 

560 565 570 

Asn Gin Pro Asp Ser Lys Arg Arg Gin Thr Asn Asn Gin Asn Trp 

575 580 585 

Gly Ser Gin Pro lie Ala Gin Gin Pro Leu Gin Gly Gly Asp His 

590 595 600 

Ser Gly Asn Tyr Gly Tyr Lys Ser Glu Asn Gin Glu Phe Tyr Gin 

605 610 615 
Asp Thr Phe Gly Gin Gin Trp Lys 

620 



<210> 24 

<211> 786 

<212> PRT 

<213> Homo sapiens 

<220> 

<22l> misc_feature 

<223> Incyte Identification No. : 4163642CD1 
<400> 24 



26/47 
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Met Ser Phe 
1 

Arg Gin Ala 

His Phe Gly 

Ala Thr Ala 

Thr Ser Leu 

Ala Asp Gly 

Asn Glu Val 

lie Gin Lys 

His Gin Ala 

Ser Leu Asp 

Gly Ala Ala 

Ala Lys Gin 

Lys lie Ser 

Arg Ala Met 

Ser Gly Lys 

Ser Gly Val 

Phe Glu Lys 

Gly Glu Glu 

His Val Ser 

Glu Val Ala 

Arg Gly Trp 

Glu Val His 

Glu Leu Met 

Lys Arg Leu 

Leu Asp Asn 

Asn Asp lie 

Glu Ser Ala 

Ala Gin Ala 

Leu Val Ala 



Ser Arg Ala 
5 

Gly His Arg 
20 

Pro Phe Pro 
35 

Ser Ser Ser 
50 

Phe Val Pro 
65 

Asp Val Gly 
80 

Lys Lys Val 
95 

Leu Gly Ala 

110 
Phe lie Ser 

125 
Val Asp lie 

140 
His Ala Asp 

155 
lie Phe Pro 

170 
Asp Leu Arg 

185 
Gin Arg Lys 

200 
Thr Tyr His 

215 
Tyr Cys Gly 

230 
Ser Asn Ala 

245 
Arg Val Thr 

260 
Cys Thr Val 

275 
Val lie Asp 

290 
Ala Trp Thr 

305 
Leu Cys Gly 

320 
Tyr Thr Thr 

335 
Thr Pro lie 

350 
Leu Arg Pro 

365 
Tyr Ser Val 

380 
Val He Tyr 

395 
Lys Lys Phe 

410 
Thr Asp Ala 



Leu Leu Trp 
Ala Ala He 
Gly Val Leu 
Ala Ser Gly 
Leu Thr Val 
Ala Glu Leu 
Leu Asp Lys 
Asp Tyr Gly 
Phe Arg Asn 
His He Val 
Asp Leu Phe 
Val Leu Asp 
He Pro Pro 
He He Phe 
Ala He Gin 
Pro Leu Lys 
Ala Gly Val 
Val Gin Pro 
Glu Met Cys 
Glu He Gin 
Arg Ala Leu 
Glu Pro Ala 
Gly Glu Glu 
Ser Val Leu 
Gly Asp Cys 
Ser Arg Gin 
Gly Ser Leu 
Asn Asp Pro 
He Gly Met 



Ala Arg Leu 
10 

Cys Ser Ala 
25 

Gly Gin Val 
40 

Gly Ser Lys 
55 

Lys Pro Gin 
70 

Thr Arg Pro 
85 

Phe Tyr Lys 
100 

Leu Asp Ala 
115 

Tyr He Met 
130 

Leu Asn Asp 
145 

Pro Phe Phe 
160 

Cys Lys Asp 
175 

Asn Trp Tyr 
190 

His Ser Gly 
205 

Lys Tyr Phe 
220 

Leu Leu Ala 
235 

Pro Cys Asp 
250 

Asn Gly Lys 
265 

Ser Val Thr 
280 

Met He Arg 
295 

Leu Gly Leu 
310 

Ala He Asp 
325 

Val Glu Val 
340 

Asp His Ala 
355 

He Val Cys 
370 

He Glu He 
385 

Pro Pro Gly 
400 

Asn Asp Pro 
415 

Gly Leu Asn 



Pro Ala Gly 
15 

Leu Arg Pro 
30 

Ser Val Leu 
45 

He Pro Asn 
60 

Gly Pro Ser 
75 

Leu Asp Lys 
90 

Arg Lys Glu 
105 

Arg Leu Phe 
120 

Gin Ser His 
135 

He Cys Phe 
150 

Leu Arg His 
165 

Asp Leu Arg 
180 

Pro Asp Ala 
195 

Pro Thr Asn 
210 

Ser Ala Lys 
225 

His Glu He 
240 

Leu Val Thr 
255 

Gin Ala Ser 
270 

Thr Pro Tyr 
285 

Asp Pro Ala 
300 

Cys Ala Glu 
315 

Leu Val Met 
330 

Arg Asp Tyr 
345 

Leu Glu Ser 
360 

Phe Ser Lys 
375 

Arg Gly Leu 
390 

Thr Lys Leu 
405 

Cys Lys He 
420 

Leu Ser He 
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27/47 



wo 00/11171 



PCT/US99/19361 



425 430 435 

Arg Arg lie lie Phe Tyr Ser Leu lie Lys Pro Ser lie Asn Glu 

440 445 450 

Lys Gly Glu Arg Glu Leu Glu Pro lie Thr Ttir Ser Gin Ala Leu 

455 460 465 

Gin lie Ala Gly Arg Ala Gly Arg Phe Ser Ser Arg Phe Lys Glu 

470 475 480 

Gly Glu Val Thr Thr Met Asn His Glu Asp Leu Ser Leu Leu Lys 

485 490 495 

Glu lie Leu Lys Arg Pro Val Asp Pro lie Arg Ala Ala Gly Leu 

500 505 510 

His Pro Thr Ala Glu Gin He Glu Met Phe Ala Tyr His Leu Pro 

515 520 525 

Asp Ala Thr Leu Ser Asn Leu He Asp He Phe Val Asp Phe Ser 

530 535 540 

Gin Val Asp Gly Gin Tyr Phe Val Cys Asn Met Asp Asp Phe Lys 

545 550 555 

Phe Ser Ala Glu Leu He Gin His He Pro Leu Ser Leu Arg Val 

560 565 570 

Arg Tyr Val Phe Cys Thr Ala Pro He Asn Lys Lys Gin Pro Phe 

575 580 585 

Val Cys Ser Ser Leu Leu Gin Phe Ala Arg Gin Tyr Ser Arg Asn 

590 595 600 

Glu Pro Leu Thr Phe Ala Trp Leu Arg Arg Tyr He Lys Trp Pro 

605 610 615 

Leu Leu Pro Pro Lys Asn He Lys Asp Leu Met Asp Leu Glu Ala 

620 625 630 

Val His Asp Val Leu Asp Leu Tyr Leu Trp Leu Ser Tyr Arg Phe 

635 640 645 

Met Asp Met Phe Pro Asp Ala Ser Leu He Arg Asp Leu Gin Lys 

650 655 660 

Glu Leu Asp Gly He He Gin Asp Gly Val His Asn He Thr Lys 

665 670 675 

Leu He Lys Met Ser Glu Thr His Lys Leu Leu Asn Leu Glu Gly 

680 685 690 

Phe Pro Ser Gly Ser Gin Ser Arg Leu Ser Gly Thr Leu Lys Ser 

695 700 705 

Gin Ala Arg Arg Thr Arg Gly Thr Lys Ala Leu Gly Ser Lys Ala 

710 715 720 

Thr Glu Pro Pro Ser Pro Asp Ala Gly Glu Leu Ser Leu Ala Ser 

725 730 735 

Arg Leu Val Gin Gin Gly Leu Leu Thr Pro Asp Met Leu Lys Gin 

740 745 750 

Leu Glu Lys Glu Trp Met Thr Gin Gin Thr Glu His Asn Lys Glu 

755 760 765 

Lys Thr Glu Ser Gly Thr His Pro Lys Gly Thr Arg Arg Lys Lys 

770 775 780 
Lys Glu Pro Asp Ser Asp 

785 



<210> 25 

<211> 260 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte Identification No.: 4906154CD1 
<400> 25 

Met Thr Pro Val Gin Arg Gly Gly Pro Gly Ala Leu Val Ala Leu 

15 10 15 

Gly Trp Gly Arg Arg Lys Ala Glu Asp Lys Glu Trp Met Pro Val 

20 25 30 

Thr Lys Leu Gly Arg Leu Val Lys Asp Met Lys lie Lys Ser Leu 

35 40 45 

Glu Glu lie Tyr Leu Phe Ser Leu Pro lie Lys Glu Ser Glu lie 

50 55 60 

lie Asp Phe Phe Leu Gly Ala Ser Leu Lys Asp Glu Val Leu Lys 

65 70 75 

lie Met Pro Val Gin Lys Gin Thr Arg Ala Gly Gin Arg Thr Arg 

80 85 90 

Phe Lys Ala Phe Val Ala lie Gly Asp Tyr Asn Gly His Val Gly 

95 100 105 

Leu Gly Val Lys Cys Ser Lys Glu Val Ala Thr Ala lie Arg Gly 

110 115 120 

Ala lie lie Leu Ala Lys Leu Ser lie Val Pro Val Arg Arg Gly 

125 130 135 

Tyr Trp Gly Asn Lys lie Gly Lys Pro His Thr Val Pro Cys Lys 

140 145 150 

Val Thr Gly Arg Cys Gly Ser Val Leu Val Arg Leu lie Pro Ala 

155 160 165 

Pro Arg Gly Thr Gly lie Val Ser Ala Pro Val Pro Lys Lys Leu 

170 175 180 

Leu Met Met Ala Gly lie Asp Asp Cys Tyr Thr Ser Ala Arg Gly 

185 190 195 

Cys Thr Ala Thr Leu Gly Asn Phe Ala Lys Ala Thr Phe Asp Ala 

200 205 210 

lie Ser Lys Thr Tyr Ser Tyr Leu Thr Pro Asp Leu Trp Lys Glu 

215 220 225 

Thr Val Phe Thr Lys Ser Pro Tyr Gin Glu Phe Thr Asp His Leu 

230 235 240 

Val Lys Thr His Thr Arg Val Ser Val Gin Arg Thr Gin Ala Pro 

245 250 255 
Ala Val Ala Thr Thr 

260 



<210> 26 
<211> 1872 
<212> DNA 

<213> Homo sapiens 
<220> 

<22l> misc__f eature 

<223> Incyte Identification No.: 399781CB1 
<400> 26 

gccctctagc tgtgtgtgtc tgaggctcgg ccgcctgagc cgcggacggt ttgctgagcc 6 0 
cgttagtgcg cccggccgag acacgccgcc gccatgtccc gctacctgcg tccccccaac 120 
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acgtctctgt tcgtcaggaa cgtggccgac gacaccaggt ctgaagactt gcggcgtgaa 18 0 

tttggtcgtt atggtcctat agttgatgtg tatgttccac ttgatttcta cactcgccgt 240 

ccaagaggat ttgcttatgt tcaatttgag gatgttcgtg atgctgaaga cgctttacat 3 00 

aatttggaca gaaagtggat ttgtggacgg cagattgaaa tacagtttgc ccagggggat 360 

cgaaagacac caaatcagat gaaagccaag gaagggagga atgtgtacag ttcttcacgc 42 0 

tatgatgatt atgacagata cagacgttct agaagccgaa gttatgaaag gaggagatca 4 80 

agaagtcggt cttttgatta caactataga agatcgtata gtcctagaaa cagtagaccg 540 

actggaagac cacggcgtag agaagccatt ccgacaatga tagaccaaac tgcagctgga 600 

atacccagta cagttctgct tactacactt caagaaagat ctgaaagcgg aaaaagaacc 660 

aaagaagggc agttcaagcg accaaagggt gggtggaagg tgctgcagta tgaatactgt 720 

acgaatattt tgactctggt ctgaaaagat aaaagaatgt tatcgaaaac tacatggaat 780 

aattgaagtc ccttcaagtt tgaaagtaag cattttagga caaataaaag gaaattcaac 840 

tttgtacttg tggaaactaa tccctaaata tgaataggtt tatattgatt catgggtaac 900 

aggtccataa taaattattg gaaactagga tgtctgaata tcaaggaaga cagccatagt 960 

ctcttacagt gcctctgttg gtctgtctca aactgaattg ggtgggaaaa ggtatggtcc 1020 

aatataaaag ttccattttt gccattattg gcaaatcttg cctttgttta ttttggtgcc 1080 

agtgttttct gcttaatcat ttgctttgtt ggcatctgtg tttatttact tgtacaccac 1140 

atgcagttta catctgtctt aactactcct tcccaggtaa attccaatta tatttgacat 1200 

ccagctaaga gggcccatct cttctcacct ctttcctagt cagtatattc agcaaatatt 1260 

tattgagccc ttactgtggg caaatcattg tactggataa ttgagaaaaa tagataattc 1320 

ccttattcag taaatgtcta ctgagcacaa tctagtgaat cattacagta tggcctcatt 1380 

gttttgtttg aggtgtgtta ttcataacaa tattttacac cattcgtatc aatgtaatta 1440 

tagaacacaa tatacgatca aggataagta attgtgtggt tatctgccat ttaaaagtat 1500 

ccagtatttg atcacattat tataaataat gaaaaaatga tttaatctgt aataaactgg 156 0 

tttattgtgc agtgactgta atatactaga gttataataa attgtttact ctgcctcacc 1620 

aaacacatgc taggatataa cccccaaaat aagtatttaa ctttgcatta ggtataaagg 1680 

agactgggtg ctataattag attattttga ggcagacaga gagctgttat cctaactgat 1740 

ttagtatgtt ctgtaattga gaaaatgttc accaaattat actttttagt gatttacatg 1800 

tacattttat aggggacatg ttctgtgtat agcgaataaa taacttttat agtatcaaaa 1860 
aaaaaaaaaa aa 1872 



<210> 27 
<211> 3834 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte Identification No. 



1806542CB1 



<400> 27 

ctgcggcggc 

cccgcccgct 

agaaggtgcc 

ggcgttgtag 

agctctctgc 

aggcgggatg 

cctattacaa 

ggctaaatcc 

taaaaatttc 

gtgatgatca 

actcagaact 

aagctcagat 

tcttcgcctt 

ttctctcagt 

ctattgattc 



ggcggcggcg 
tgcttgcttg 

^gggg^cgca 

ctgcggccgt 
gtgccgcgcc 
gagtgatagg 
ccagaaaact 
aaatgctgat 
cccagatgcc 
tgtgaagttt 
aaccactgtt 
gctgaatccc 
gctttttgtt 
agtggaccta 
agagttggtg 



ttaccggccc 
cttgcttgcc 
gagcgactag 
ggcggaggac 
gctccgctcc 
gaagatgttt 
acaagtataa 
tcagacttta 
tggcaggtgt 
ttctgctttc 
caacaacagc 
caaccagaga 
acagagtatc 
aatccaaggg 
gatcgtgatg 



tcgcgctctt 
tgcctgcctg 
aggcgcgcgg 
tacggcgaca 
gctggctgac 
ataaattctt 
cagcgaggat 
gacaaagggc 
gtgcagaagc 
aagtactgga 
taattaggga 
agacctttat 
tcactaagtg 
gagtagatct 
tggtgcatac 



tcccttcctg 
cctgcccggc 
tcccggccag 
aggacgaggg 
catctggagt 
ctgtgggatc 
ggatgaacag 
cctggcctat 
tctagcccag 
acatcaagtt 
gacgctcata 
acgaaataaa 
gcccaagttt 
ctacctgcga 
atcagaggag 



gggcgccgac 
gccacgcaag 
caccgtctct 
ccgctctccc 
gcaggctggg 
agagggcacg 
gctctattag 
tttgagcagt 
aggacataca 
aaatacaaat 
tcatggctgc 
gccgcccaag 
ttttttgaca 
atcctcatgg 
gctcgtagga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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atactctcat 
accaaatatt 
tagttggggc 
atatgctgct 
ttgaagttgt 
gtcaagtatt 
tggccagatt 
aattaattaa 
aagtggcact 
ttggattttg 
aaaaagctaa 
aatataactt 
tgaagttact 
gcagagtttt 
tagcaataag 
tctcaggtga 
caggagtcag 
gatatgaaaa 
tagatcacag 
tttctagatt 
atagaataca 
gcagcgatga 
atccggcaga 
ttaaaattct 
cagactgtct 
aacagactgt 
tgccagccct 
ttcatcgaat 
aacatatgct 
agattacggc 
tgcttcatgc 
ctttagagaa 

gtgggatgag 

ctgttatcca 
tcctctcaaa 
ttgtttataa 
tggcagatgc 
atctcaaacg 
aagtagctcc 
tttttaaaaa 
ctgtgcctac 
cattcacact 
ttctgtaaaa 
catgccggtc 
aaccttaatt 
ggaaaacagt 
gtgtgaagac 
atttcatttc 
ggctccagtg 



aaaagatacc 
acaaaattat 
ttatgtctct 
aggtcatatg 
aaataaagga 
acagtctgct 
ttctaagttg 
gaatggggat 
gatgttgcag 
ttacgattat 
tgtagaggca 
tgaaaatgag 
gttggacagg 
tagttctaca 
attgctgtat 
tgtttcaaaa 
ttcctatcag 
gtttttcaca 
aggtctgcgg 
tgtcaaatct 
agatttatta 
tcaacttttt 
aaggaaacaa 
gttagaaaag 
taaccatgct 
gaaacaatgt 
cagttgtccc 
gattatttgc 
caaagattgt 
caaattcaag 
aatttttgaa 
gcagatgttg 
cgaagttata 
aggagcagtt 
gttggtagaa 
gcacattgtc 
acaaacagta 
gggcccagaa 
agaaataatt 
ttacttaaag 
ttcatgafcca 
ggtctttttc 
tgggtgtaat 
caaattgttc 
cttttttatt 
gtacttttta 
acaaagtatg 
tttaaactgt 
ttatactttt 



atgagggaac 
cagtttacta 
tggatagact 
tcaatagaag 
atggaccctg 
gggtttttca 

gtaaatggaa 
attaagaatg 
ctactaattc 
cttcatattt 
atcatgttgg 
ggtgaagatg 
cttgctcaag 
ctgcagaatt 
atgttggcag 
gctagtgctt 
catacatctg 
gttgaacctc 
cattccagtg 
ctcaataagc 
gagctttctc 
atttatgaga 
gccttaatga 
ttgatgctgg 
gttggatttg 
ggctgttccg 
ttacaaaagg 
ctggaggaag 
gaagcaaaag 
atacaggtat 
gtgctgctcc 
cggaggagtt 
gcaaatcaag 
gaatatccag 
ctctggggag 
cccgcatgtt 
ttggctttat 
tgtgttcagt 
caggagtttt 
gtgttcttcc 
tgaattccag 
acattgtttt 
tttcctaata 
tgtataaaga 
tgaaatttta 
aaaaattgct 
ggggaagaca 
ttcaacgcca 
ttttatatat 



agtgcattcc 
attctgaagt 
tatcccttat 
ttctacggga 
ttgataaaat 
gcattgacca 
tgggacagtc 
ctcaagaggc 
atgaggatga 
tgaaacagct 
ccgttatgaa 
aagccatgtt 
tttcaccaga 
ggcagactac 
aagctcttcc 
tgcaggatat 
tgacattgga 
agcacattcc 
caaaagttcg 
aaatgaatcc 
cacctgagaa 
cagctggagt 
ggaatctgtt 
cacaagatga 
caagtcgaac 
aagtttatct 
atattctcag 
aagttcttcc 
atctccagga 
ccccgttttt 
ggccagcaga 
actttgcttt 
gtgcagagaa 
atccaattgc 
gtaaagatgg 
tcctagcacc 
ctgagtgtgc 
atcttcaaca 
gtcaagcgct 
agagagcaaa 
ttaatttata 
gagcttattg 
caggtatgta 
tgctcttaaa 
agtcaagtcc 
gaatataaaa 
gcaatcaaaa 
atatgtattc 
atatataaaa 



aaatctggtg 
gacgtgtcag 
agccaatgat 
agaagcatgt 
gaaactagtg 
ggaagaagat 
attgatagtt 
actacaagct 
tgatatttct 
tacagtgctc 
aaaattgact 
tgtagaatat 
gttactactg 
acggtttatg 
agtatctcat 
gatgcgaact 
gttcttcgaa 
atgtgtacta 
gagcaggacg 
tttcattgag 
tggccaccag 
gctgattgtt 
gactccacta 
agaaaggcaa 
cagtaaagct 
ggactgttta 
aagtggagtc 
gttcattcca 
gttcattcct 
acaacagatg 
agaaaatgac 
cctgcaaaca 
tgtagaaaga 
acagaaaaca 
accagtggga 
tttaaaacaa 
agtgacactg 
agaatacctg 
tcagcagcct 
gccctgagga 
aagaggcgat 
cagtatatgt 
acaacaaaag 
agacacaaga 
tttataaaga 
tctttgaaaa 
ctaacttttt 
tacaaaagag 
ataaacttta 



gaatcatggt 
tgccttgaag 
aggtttataa 
gactgtttat 
gaatctttgt 
gttgacttcc 
agttggagta 
attgaaacaa 
tctaatatta 
tcggatcagc 
tacgatgaag 
agaaaacaac 
gcctctgttc 
gaagttgaag 
ggtgctcact 
ctggtaacat 
actgttgtta 
atggctttct 
gcttacctgt 
gatattttga 
tccttactga 
aatagtgaat 
atggagaagt 
gcctctctag 
ttcagcaaca 
cagacattct 
cgtactttcc 
tctgcttcag 
cttatcaacc 
ttcatgcccc 
cagtctgctg 
gtcacaggca 
gtgttggtta 
tgttttatca 
tttgctgatt 
acctttgacc 
aaaacaattc 
ccctccttgc 
gatgctaaag 
ctggattt cc 
ttttgtgtgc 
tttgggattt 
aagttgcctg 
gttatcctag 
ccatagcagt 
ttttctttat 
gtagatagcc 
aatggtttta 
cgtt 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3834 



<210> 28 

c2ll> 2178 

<212> DNA 

<213> Homo sapiens 

<220> 



31/47- 



wo 00/11171 



PCT/US99/1936I 



<221> misc_f eature 

<223> Incyte Identification No.: 

<400> 28 

cagctcccta agcggttgtc accgctggag 
acacgaggca aacgacttct cccttctttg 
taactatcgc ctgacaggca tttaaatcaa 
ataatggtcc aaatgacgct agtgatggga 
gcagcaaaga ggaaatagtt cgagttcttt 
ggagtgaaat caaaggattt tatgatccac 
catatgatag accaatagga ggaagagggg 
gaggttttga tgactatggt ggctataata 
acagaatgag agatggaaga ggtatgggag 
gttcaggttt tcatggtggt catttcgtac 
aaaatgccat tgctaatttc ttctcaccac 
gagctgatgg cagagccaca ggagaagcag 
tagctgccat gtctaaagat aaaaataaca 
attctactcc tggaggcggc tctggcatgg 
atggaatgga taatcaggga ggctatggat 
acagtggagg atatggtact cctgatggtt 
gtggaggtta ctatgggcaa ggcggcatga 
agcaaaaaca ccaacataca agtcttgaca 
gatttaattt cttttgtatt ttaagaactt 
tattatttgg taaagcaaca gattgtgatg 
catactttga cttaaaaata aatttttata 
tactagttac tcctaaagat gtgctgcctt 
tagttctaca agaagtagtg tggtgtaatt 
ctgcaagtct taagcagaca tctggaatag 
ctttagttcc tcctggacaa cactgtaaat 
gtataaattc agctaattat ttctatatta 
gctctgaaac attgatgatc taagaggtat 
tgggtatgag agtgttggaa gctgaattct 
gcacttgacG gaaataccaa aaatgtctcc 
atgtcttaga gtagggttaa ggttctcagt 
tatttactat ggagtataat tctcacaatt 
aaataactgt ataaatgatg actttaaaaa 
ataaaaacaa tcctgcagtt gggttttgta 
gaatctatat gtagcaagga aaaggtgctt 
ttttttccaa attggctaat ggatcaaaat 
acttgttact tgtttttgcc agaaatgtta 
aaaaaaaaaa aaactggg 



2263514CB1 



acggttggga gaaccgttgt ggcgagcgct 60 
aactggaccc cgcgagcacc agagtcggcg 12 0 
acggtattga gatggattgg gttatgaaac 180 
cagtacgact tcgtggacta ccatttggtt 240 
caaggtatat tgagatcttc agaagtagca 3 00 
caagaagatt gctgggacag cgaccgggac 36 0 
gttattatgg agctgggcgt ggaagttatg 42 0 
attacggcta tgggaatgat ggctttgatg 480 
gacatggcta tggtggagct ggtgatgcaa 540 
atatgagagg gttgcctttt cgtgcaactg 600 
taaatccaat acgagttcat attgatattg 660 
atgtagagtt tgtgacacat gaagatgcag 720 
tgcaacatcg atatattgaa ctcttcttga 780 
gaggttctgg aatgggaggc tacggaagag 84 0 
cagttggaag aatgggaatg gggaacaatt 900 
tgggtggtta tggccgtggt ggtggaggca 96 0 
9tggaggtgg atggcgtggg atgtactgaa 102 0 
acagcatctg gtctactaga ctttcttaca 1080 
tataatgact gaaggaatgt gttttcaaaa 1140 
ggaaaatgtt ttctgtaggt ttatttgttg 1200 
ttcaaaccac tgatgttgat actttttata 1260 
cataagattt gggttgatgt attttactat 1320 
ttagaggata atggttcacc tctgcgtaaa 1380 
agcttgacaa ataattagtg taactttttt 1440 
ataaagccta aagatgaagt ggcttcagga 1500 
ttatttttca aatgtcattt atcaggcata 1560 
tgatttctga atattcataa ttgtgttacc 1620 
agccctagat tttggagtaa aaccccttca 1680 
aaaaaattga tagttgcagg ttatcgcaag 1740 
gacacaagaa ttcagtatta agtacatagg 1800 
gtattttcag ttttctgccc aatagagttt 1860 
aatgtaagca acaagtccat gtcatagtca 1920 
tctgatccct gcttggagtt ttagtttaaa 1980 
tttaatttta atccctttga tcaatatggc 2040 
gaaacctgtt gatgtgaatt cagttattga 2100 
ttaataaatg tcaatgtggg agataaaaaa 216 0 

2178 



<210> 29 

<211> 1503 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte Identification No. : 2738270CB1 

tgggggcggc ggcggcggaa 6 0 
tgaccgagga gctccttttc 12 0 
ttccaaaaga taaggatggt 18 0 
tgtctgttcc ttatgcaatg 240 



<400> 29 

acgacctttt 

gcggatcgca 

gagcttttcc 

aaaccaaagc 



ggccaggtta 
ctctctttgt 
accaggctgg 
agtttgcgtt 



gggagggggc 
gggcaacctt 
gccagtaata 
tgtgaatttc 



gacgctgaga 
gaaacgaaag 
aaggtgaaaa 
aaacatgaag 
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wo 00/11171 
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aatctactta 
agtagtcatg 
agccctacct 
tcagcacaga 
atgaacagtg 
caatcaggat 
tcccagtggc 
tacctagcag 
cattacagag 
catgactatg 
taacacatgt 
atatggaaat 
ttataaattt 
taacctttca 
aattcctaat 
ttaattattt 
tccagattga 
cagtgattca 
aggcattgct 
taccctagtt 
aattgttcat 
acc 



atggaatcaa 
ccccacaaga 
ccacatctcc 
taattcagag 
ctttgagaca 
tttcaccatc 
gccaaggtac 
atagacatta 
gaaagagaga 
ataacagaag 
taaaaggaca 
attttgttga 
ttttaaagct 
cctcagggtt 
cattttagac 
gttatctttt 
cagtctaaaa 
ctttactaac 
taacttctta 
gaacatacat 
taaaacccca 



actttatgga 
tgtcagtttg 
tagcagcagg 
atctttctct 
aatgtcatat 
agttcaatca 
accatcatca 
tagccgggaa 
tgatttcttc 
agacagtagt 
ttgtttttat 
aaaactgtac 
acagtttaat 
ttatgaagag 
actttaggag 
tgtattgcaa 
tggctactgg 
attctatttg 
tataatttag 
gtgtagattt 
tttgaggtat 



aggcctatca 
tcatatcccc 
tacgaaagga 
tctccagaaa 
ggtggaaaat 
cacagtcata 
cagcgtaaag 
cagcgttaca 
tatgaagaca 
agagatggaa 
agggtcattt 
agagcagctt 
acaaaatgaa 
gaaagggttt 

ggggtgaagt 

gaaatttctt 
tattttagtt 
agaaggctta 
gtataaattc 
acacatactg 
aaggtcactc 



aaattcaatt 
aacatcatgt 
ctatggataa 
attttcagag 
ttggttcttc 
gtttcaatca 
tcagaatgaa 
ctgatcatgg 
ggaatcatga 
aatggcgctc 
taggcccttt 
tacaagttgt 
ttgcggfcttt 
tatgcaaaag 
tgtatgataa 
gctagtgaat 
aattcaaaaa 
ttggtaaagt 
tgtgacatgc 
tttcattcta 
aggaaggtta 



tagatcagga 
tggaaattca 
catgacttca 
acaagcagtg 
acctctggat 
gtcttcaagc 
ttcttatccc 
gtctgaccat 
tgactggagc 
atctcgacac 
gactaagttg 
cacattttct 
attacattaa 
aaagtgctac 
agcagatatt 
caagaaaaca 
tgaaactttt 
ttggggataa 
tcttgagctt 
aaaattttag 
aaatatctcc 



300 

360 

420 

480 

540 

GOO 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1503 



<210> 30 

<211> 2548 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte Identification No. 



2824412CB1 



<400> 30 

tctgggagtc 

tgactcatct 

ggagttacag 

agacagtttg 

aggtaaagac 

tctccgagga 

attattaagc 

gtatgacata 

caaaatgcaa 

aggagatgta 

ccttgacgct 

gcagtcctgt 

gtctgatttt 

tagccctcag 

tattcttaaa 

ggcaacaatg 

actccttgca 

ccaacgtttt 

atttgaagct 

aaatcttcag 

gtctacattg 

atgatattaa 

aaatgtgctc 



cgaccaggca 
gatgaccgtt 
gcagttcaga 
tctgaacatg 
sg^gctttga 
gatagaaatg 
cgtattgcag 
aaatgtgctg 
gtcactatca 
acctcgggta 
ctggctgctc 
gtgattatca 
ccaagctggg 
agccctgggg 
ggtagtcctg 
actgaccagc 
ttccgccaga 
aacatccaca 

tgttaaaaaa 
aaaaatttat 
aaaatgtaca 
cgactagggg 



tgcctcctca 
atgtaatgac 
aaattgtttc 
agaagaacaa 
aaggagtttt 
tcaaccttgt 
aaaacctacc 
tatctgaagc 
cactgacatc 
tggtgaaaga 
tacgccacgc 
tacgcattct 
ctatggagtt 
atgcactgag 
gacttctgga 
agcgtgaaga 
tacacaaagt 
acaacaggaa 
aagacaaaaa 
acagatgccc 
caagaattta 
gtgttaggta 
gaaaacagta 



gcctcagggg 
aaaacatgcc 
tattactgaa 
gaacaaagag 
gcgagtggga 
tttgctgtgc 
caaacagctt 
ggcaataatt 
tccaattatt 
cccaccggac 
taagtggttc 
tcgagacctc 
actagtagag 
aagagttttt 
tccttgtgaa 
catcacatcc 
tctaggcatg 
acgaagaaga 
agattatgat 
atttgttggc 
aaggatttca 
ttatttgaat 
gttccgattt 



cctgcaccct 
accatttatc 
cgtgctttaa 
ggagatgata 
gtattggcaa 
tcagagaaac 
gctgttataa 
ttgaattcat 
cgagaagaga 
gtcttggaca 
caggctagag 
tgtcagcgag 
aaagcaatca 
gaatgcattt 
aaggatccct 
^gtgcacagt 
gatccattac 
gatagtgatg 
aacttttaaa 
tgtttttcat 
tggaagaacc 
ggaaagacac 
tttcccatta 



tacgtcgtcc 
caactgaaga 
aactcgtttc 
agaaagaggg 
aaggattact 
cttcaaagac 
gccctgagaa 
gtgtggaacc 
acatgaggga 
ggcaaaaatg 
ctaatggtct 
ttccaacttg 
gcagtgcttc 
cttcagggat 
ttgatacctt 
ttgcattgag 
cgcaaatgag 
gagttgatgg 
aagtgtctgt 
tcataataat 
aagtttttct 
ccaaaaaaaa 
tttttatttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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attttctggt 
tcttattaaa 
tgatatttta 
tagatggtat 
gactgcagag 
tcgagtgctc 
ataaaggtgt 
ttagcagtat 
taattgataa 
taggtttaac 
ctttctgaat 
ctaaccaata 
aattatggtg 
aacacagatg 
tgaggttgat 
ggtgtggcca 
taatgaaaaa 
gttgtcacag 
attatatgtt 
aaaatcatat 



tgccctagct 
tcttcactgt 
atttagaggt 
cctttacccc 
tttgtttttc 
agaactgaat 
aattaagaca 
aaaagctaat 
taaaaatgca 
acgtagatct 
atttttttct 
ttttcaaaag 
acagttcctt 
atcttctcca 
aaagtattac 
tcttacctgc 
ttacatctta 
cactacagtg 
agaattgaca 
tcaaatttaa 



tcccccccta 
atttaatgca 
tttgtttgct 
ttcttaatat 
tatatgtgaa 
gtttctgcag 
gagttctgtt 
atatactata 
tgacattctt 
ttgtacagtt 
tgtgtataag 
aacttcatca 
attggtgtgt 
cacaccccat 
tgataaaaaa 
ttatgtctcc 
ctgttgatat 
aaatacacaa 
acataaacta 
aaaaaaaa 



tttttgtgtc 
ggatgtgtgc 
ttttgacact 
tttacagcag 
ggattacaac 
atcttgtggc 
aatctaatca 
tggtcttgca 
gtttttaata 
gactttttga 
tggaaagggc 
ttgtacaact 
gtgagattac 
cacccagata 
ttatctaagg 
tacacaaagc 
afcgtatgctc 
aaaatgaaat 
cttttgcttt 



ttttattaac 
ttcagttgct 
agttgtaagt 
tacgtttttt 
acaaaaagtt 
atttgtctct 
agtttgctgt 
acagttttaa 
gacttttaaa 
catagcaagg 
atttttcaca 
aacaacagta 
tctagcaact 
atttacagtt 
aaaaaaacag 
taaatattct 
tggtacacag 
tcatataatg 
gaaatgatgt 



tagtgcattg 
ctgtgtattt 
tactttgtta 
tgtaacgtga 
atcctgccat 
agtgtgatat 
tagttgtgca 
agcctctgca 
atcataattt 
ccaaaaataa 
tataagtggg 
actagccctt 
attacagtat 
ctgttaacag 
aaaattattt 
agcagtgatg 
atgtcatttt 
acttaaatgt 
atgcttcagt 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2548 



<210> 31 

<211> 811 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte Identification No.: 002690CB1 



<400> 31 

cgcatgcgtt 

ccaaggggcg 

gccgcgtatt 

tgctcacagg 

tgaggggcta 

ccatgcgcat 

tactggcccc 

atcggagttt 

ccctgcctct 

tgcggcagga 

caccagggat 

ggcctttgga 

tcttgatggg 

aaagtcttta 



ccctgaaatt 
ggcagaagtt 
tcgccgtatg 
gctggtgcgg 
cgcggagaag 
ggctgacttc 
tcggtacaaa 
ggatcgggcc 
gcctcgcaga 
cctcaggcaa 
ttaactggat 
gcacttttac 
gttgagaacc 
tatgaatata 



gccgccaccg 
ggaaacatgc 
ggcctcggtc 
cacgaacgca 
ctcatcgact 
tggctcacag 
gatcaaactg 
aagatggcag 
gacagccacc 
agccaggaag 
ctgaagagtc 
tctctgagaa 
ttctggggag 
aaaaaaaaaa 



gctctacctt 
ggctgtcggt 
ccgagtcccg 
tcgaggcacc 
atgggaagct 
agaaggattt 
ggggctacac 
tgatcgagta 
ttacactcct 
caagcaacca 
tgcagccctt 
gaactggagc 
ccagatgacc 
a 



ccagtttcca 
cgctgcagcg 
catccatctg 
atgggcgcgt 
gggagacact 
gatcccaaag 
aagaatgctg 
taaagggaat 
aaaccagctg 
cagctcccac 
aatcagtacc 
tagagatgta 
ctctctttgc 



gttccggcct 
atctcccatg 
ttgcggaact 
gtggacgaaa 
aacgaacgag 
ctgtttcaag 
cagatcccaa 
tgcctcccac 
ctgcagggtt 
acagctcaaa 
catgatcaca 
aaatggacag 
acaatagata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
811 



<210> 32 

<211> 1457 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte Identification No. : 041108CB1 
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<400> 32 

tacggaagct 

aggcaggagt 

gtcggacccc 

gtagggcggc 

tgacctcccg 

gggcctggct 

gtgcgaccga 

aagtcgagct 

ggggcttcga 

cgcccaggct 
gctatgccaa 
agaaaggcac 
gtgttgccca 
gggcgctggg 
tggactccct 
gccgctgggg 
tcgtggaggc 
tgcacagcat 
aggacaagct 
acagcgactt 
actgttgatg 
caggcccacg 
gctgaggcca 
acgggcttct 
ggaggaaaaa 



gggtcttctt 
gaaaattggg 
ctccatcttc 
gtcgcgtgcg 
cggcgtggga 
tcggcctacc 
gaacccggag 
cccggtaccc 
gcaggagcgc 
ggacatactg 
gaccaagacg 
tgggcgggcc 
tggcccccgg 
tctcaaagtg 
agagctgccc 
ggactccgta 
cacctctagg 
gctcaagcac 
gctctggcag 
cccccgaccc 
tgaagcacct 
cccacccttc 
cagggagcgg 
gcatccattc 
aaaaaaa 



gctgtgaggt 
tctgggggtt 
ggttttgcac 
tgacgtcatc 
ggctgcgcgg 
ggcagccagg 
caggtggcga 
actcatcgac 
gtgggcctgg 
caccaggttg 
agagccgagg 
cggcatggca 
ggccccacaa 
gcactgaccg 
accggagacc 
ctcctcgtgg 
cttaagacct 
cagacgctgg 
gactcacgtt 
ctaccccacg 
cttctgagcc 
gaggaaggtg 
ccatcgccat 
cctctttttg 



cgcgttcccc 
agtcctgggg 
accccgcttt 
cagcggcgcc 
cgatgctgca 
gcctgagttc 
gcgagggtct 
gcccagtgca 
ccgacctgca 
ctatgtggca 
tgcggggcgg 
gcatccgctc 
gttactacta 
tcaagctggc 
cacagtacct 
acttaacaca 
tcaacttgat 
tcctgacgct 
acagacccct 
Gtacccaggg 
aggccgagcc 
tcacctggac 
tgggaagggg 
tttttaaaat 



agtgttacgg 
tggaggtctg 
ccagcgcgga 
tcgcgaggct 
gttcgtccgg 
cctggcggaa 
cccggagccc 
ggcctgggtc 
ccccgatgtt 
gaagaacttc 
tggccggaag 
tccgctctgg 
catgctgccc 
ccaggacgac 
gacagagctg 
cgaggagatg 
cccggctgtt 
gcccaccgtc 
ctaccccttc 
cccagcggcc 
cctggccgac 
cccttcattc 
cgactccacg 
aaattgtatt 



agggtccttg 
ggcacgccgg 
gtcgcggcgg 
ccagtggcct 
gccggggcgc 
gaggcagcgc 
gtgctgcgca 
gagtccttgc 
ttcgccaccg 
aagagaatta 
ccttggccgc 
cgaggaggag 
atgaaggtgc 
ctgcacatca 
gcgcactacc 
ccacagagca 
ggcctaaatg 
gccttcctgg 
agcctgccct 
accccgtacc 
ttgggagcct 
cacggaggaa 
gagagcccag 
tt tgaatcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1457 



<210> 33 

<211> 1357 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte Identification No. 



869138CB1 



<400> 33 

ctggtggcgt 

cctgacaaaa 

cgggagaaaa 

cttgcagaaa 

aatgtgaatt 

ttagaagaaa 

gaaagagaag 

gggaaagcag 

gaagaagagg 

gatgacgctg 

aagaaatcta 

cgctcatcct 

agttcgcagt 

tcaagatcca 

agttcatcat 

ggtgatcgca 

tctggatcat 

tacatcttaa 

acttggtcag 



tcaagatgtc 
aatgtggaaa 
caactgaggc 
agagccgagg 
gggccagaag 
gaacaggata 
aatctgatgg 
ttggtcctgc 
atgaggatga 
atctctcaaa 
atagacgaag 
ccccctcaag 
caagatctcg 
gctccaggtc 
cttctcctga 
aaaaaagacg 
cccattctgg 
aaaaatccag 
aagtgctaaa 



gaccaagaat 
tgtaaacttt 
caagatgatg 
cctatttagt 
atcagagtgt 
tggtggtggt 
tgaatatgat 
atctatatta 
agatctttct 
atataatctt 
tcgctcaaag 
ttcaaggtct 
ttccagttcc 
ccacaggggc 
gaggaacaga 
aacaagatca 
ttcccgttca 
tacagtgcat 
tctgctagta 



ttccgagtca 
gctagaagaa 
aaagctgggg 
gctaatgact 
aatatgtgta 
tttaatgaaa 
gagtttggac 
aaggaagttg 
aaatataagt 
gatgccagtg 
tctcgatctt 
aggtccaggt 
agagaacgtt 
tcttctfcccc 
aagagaagtc 
cggtcacccg 
agttcaaaaa 
gaagcatatt 
gaggtgcatg 



gtgacgggga 
ccagctgtaa 
gcactgaaat 
ggcaatgtaa 
atactccaaa 
gagaaaatgt 
gtaaaaagaa 
aagataaaga 
tagatgagga 
aagaagaaga 
cacattcacg 
cccgttcaag 
cgagatctcg 
cacgaaaaag 
gttctagatc 
aaagacgcca 
agaaataatg 
tttaaagaag 
cctttcattg 



ctggatttgc 
tcgatgtggt 
aggaaagaca 
aacttgcagc 
gtatgctaaa 
tgaatatata 
aaaatacaga 
atcagaggga 
tgaggatgaa 
tagtaataaa 
atcttcatca 
aagttcttcc 
tgggtcgaaa 
atcttattca 
ttcttcatct 
caggtcatca 
tattaaaatt 
ttggtgtctt 
cttttcaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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caatacagct gtgtttattt gtgaagttaa aagtaaatag cattttaagc cataatgtcc 1200 

caaaatagat gttctgtcat tcattafctta caaccatttg cttcatttaa aaccatttca 1260 

gctataacaa agtactttgc ttcctaattt aaacccattt ttgtcatttc caaatacatc 1320 

ctgtccattg gctaagacag gattacctag gcttgct 1357 



<210> 34 

<211> 1326 

<212> DNA 

<213> Homo sapiens 



<2 21> unsure 
<222> 1313 

<223> a or t or g or c, unknown, or other 
<220> 

<22l> misc_f eature 

<223> Incyte Identification No.: 934406CB1 



<400> 34 

ttttcttcgg ggactatcct tgtctgatca 
gcggaggtag gagggggaag tggaggcggg 
gtagcagagc cctctagctg tgtgtgtctg 
gctgagcccg ttagtgcgcc cggccgagac 
cccccaacac gtctctgttc gtcaggaacg 
ggcgtgaatt tggtcgttat ggtcctatag 
ctcgccgtcc aagaggattt gcttatgttc 
ctttacataa tttggacaga aagtggattt 
^ggggg'^tcg aaagacacca aatcagatga 
cttcacgcta tgatgattat gacagataca 
ggagatcaag aagtcggtct tttgattaca 
gtagaccgac tggaagacca cggcgtagca 
gcagctggaa tacccagtac agttctgctt 
aaaagaacca aagaagggca gttcaagcga 
gaatactgta cgaatatttt gactctggtc 
acatggaata attgaagtcc cttcaagttt 
aaattcaact ttgtacttgt ggaaactaat 
atgggtaaca ggtccataat aaattattgg 
agccatagtc tcttacagtg cctctgttgg 
gtatggtcca atataaaagt tccatttttg 
ttttggtgcc agtgttttct gcttaatcat 
tgtacaccac atgcagttta catctgtctt 
attgac 



ggcgggaaag acggtgccgc ccgacaatgc 60 
agtgaagtct cgcgagaaga gtcggttgcc 12 0 
sggctcggcc gcctgagccg cggacggttt 18 0 
acgccgccgc catgtcccgc tacctgcgtc 240 
tggccgacga caccaggtct gaagacttgc 3 00 
ttgatgtgta tgttccactt gatttctaca 360 
aatttgagga tgttcgtgat gctgaagacg 420 
gtggacggca gattgaaata cagtttgccc 480 
aagccaagga agggaggaat gtgtacagtt 54 0 
gacgttctag aagccgaagt tatgaaagga 600 
actatagaag atcgtatagt cctagaaaca 660 
gaagccattc cgacaatgat agaccaaact 720 
actacacttc aagaaagatc tgaaagcgga 780 
ccaaagggtg ggtggaaggt gctgcagtat 84 0 
tgaaaagata aaagaatgtt atcgaaaact 900 
gaaagtaagc attttaggac aaataaaagg 96 0 
ccctaaatat gaataggttt atattgattc 1020 
aaactaggat gtctgaatat caaggaagac 108 0 
tctgtctcaa actgaattgg gtgggaaaag 114 0 
ccattattgg gcaaatcttg cctttgttta 1200 
ttgctttgtt ggcatctgtg tttatttact 1260 
aactactcct tcccaggtaa ttnccattat 1320 

1326 



<210> 35 

<211> 3301 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte Identification No. : 1315083CB1 



<400> 35 

gaagggaagt aacgtcagcc tgagaactga gtagctgtac tgtgtggcgc cttattctag 6 0 
gcacttgttg ggcagaatgt cacacctgcc gatgaaactc ctgcgtaaga agatcgagaa 120 
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gcggaacctc aaattgcggc agcggaacct 
atcggaaact caaaatggag atgtatctga 
atcaaaacaa aagcccatga atgtgggctt 
agaagcagtg ggaaatataa aagttacaaa 
tggagaagca gcaatgcagt cttccaattc 
aaaaatggtg aatgatgctg agcctgatac 
atctgaagaa gaaagtgccg agactactaa 
taatgatgaa gatgagagtg aggtgcccag 
ggatacttcg tttgcttctc tatgtaatct 
agaaatgggt tttacaaaca tgactgaaat 
aggcagggat cttctagcag ctgcaaaaac 
ccctgcagtt gaactcattg ttaagttaag 
tattctctca cctactagag aactagccat 
gactcaccac gtgcatacct atggcttgat 
acagaaactt ggtaatggga tcaacatcat 
tatgcagaat accccaggat ttatgtataa 
tgatcgtatc ttggatgtgg ggtttgaaga 
aacacgtaga cagactatgc tcttttctgc 
aaggatttct ctgaaaaagg agccattgta 
aacagtggat ggtcttgaac agggatatgt 
gctctttaca ttccttaaga agaaccgaaa 
tatgtctgtg aaataccact atgagttgct 
tcatggaaag caaaagcaaa ataagcgtac 
ttcgggaaca ctattgtgta cggatgtggc 
ctggattgtt cagtatgacc ctccggatga 
aacagccaga ggcctaaatg ggagagggca 
gggttttctt cgttacttga aacaatccaa 
gtctaaaatt tctgacattc agtctcagct 
tcataagtca gcccaggaag catataagtc 
gaaacagatc tttaatgtta ataacctaaa 
caaggtgcct cccttcgttg atctgaacgt 
aggaggtggt ggtggatttg gctaccagaa 
taaacacatt agcaagaaat catctgacag 
cctttcatct tgaataactt tgtcctaaaa 
tttttgtaga ctttagaatt tggacttacc 
agcactgagc actgttactt ctatcacgtc 
tttaagtttc ttggttgccc aagggcagag 
aatattttaa ttttaaatat ccctccctca 
tctttttgta cctttccttc ttgttttgtg 
actgctgtaa aaggtgacct agtgtactgg 
aggttatttt ttgtttttag ttatttcttg 
aaaatgctgc agtataaaag agcaaagagc 
ttagggtggc attgctttta tagattcttg 
tgcatttttt ggagcaaaaa ctatgggttg 
taacgtttga tggctggggg gtaggaagag 
catcaataga ggtgtaatgt actgcatttc 
acaaagaacg atgatgctcc tcattaagat 
aagcctttgc actctgtaga gtacttagaa 
aagctgtcag acggtgccca gcacacatta 
tcttccaggc cctgaaacaa aaaatacatt 
gaaaaaaaaa cagcaaaatc agtgatttag 
gatttctagt gtgttttgta cagtatataa 
aagccaaagt tgctagcatc attttgctgt 
9 



aaagtttcag ggggcctcaa atctgaccct 180 
agaaacaatg ggaagtagaa aggttaaaaa 24 0 
atcagaaact caaaatggag gcatgtctca 3 00 
gtctccccag aaatccactg tattaaccaa 360 
agaatcaaaa aagaaaaaga agaaaaagag 42 0 
gaaaaaagca aaaactgaaa acaaagggaa 480 
agaaacagaa aataatgtgg agaagccaga 54 0 
tctgcccctg ggactgacag gagcttttga 6 00 
tgtcaatgaa aacactctga aggcaataaa 66 0 
tcagcataaa agtatcagac cacttctgga 72 0 
aggcagtggt aaaaccctgg cttttctcat 780 
gttcatgccc aggaatggaa caggagtcct 84 0 
gcaaaccttt ggtgttctta aggagctgat 900 
^^tgggtggc agtaacagat ctgctgaagc 96 0 
tgtggccaca ccaggccgtc tgctggacca 102 0 
aaacctgcag tgtctggtta ttgatgaagc 1080 
ggaattaaag caaattatta aacttttgcc 1140 
cacccaaact cgaaaagttg aagacctggc 1200 
tgttggcgtt gatgatgata aagcgaatgc 1260 
tgtttgtcct tctgaaaaga gattccttct 1320 
gaagaagctt atggtcttct tttcatcttg 1380 
gaactacatt gatttgcccg tcttggccat 1440 
aaccacattc ttccagttct gcaatgcaga 1500 
agcgagagga ctagacattc ctgaagtcga 156 0 
ccctaaggaa tatattcatc gtgtgggtag 1620 
tgccttgctc attttgcgcc cagaagaatt 1680 
ggttccatta agtgaatttg acttttcctg 1740 
tgagaaattg attgaaaaga attactttct 1800 
atacatacga gcctatgatt cccattctct 1860 
tttgcctcag gttgctctgt catttggttt 1920 
caacagtaat gaaggcaagc agaaaaagcg 1980 
aaccaagaaa gttgagaaat ccaaaatctt 2040 
caggcagttc tctcactgaa cacatgcctt 2100 
tgaatttttt ttccccttga tttaacagga 2160 
taacaagagt ataaattgac ttgggttgca 2220 
tctcttttat ttctgggata taaaacaggc 2280 
caaggaatat ctggtgtttc ttgtgatgat 2340 
tacaagtgta tgttaccatt ttaatataat 2400 
aagatttttg tggcatggat tgctgtgctc 2460 
gcagctggfcg gcggtgcaga aaagagtctc 2 520 
gaccttgaca gtatctaatg actcctcctg 2580 
tttgggaaat acctaagaag caccttaaga 2640 
attttaaagc aacaggcctt tctcaggtgt 2700 
taatttgaat aaagtgtcac taagcagtta 2760 
gatggaattg agatgtttga gcctcattta 2820 
ttcatttggt aacataacaa agactttcat 2880 
ttgtttaatt caaggtggtt tggatttggt 2940 
gacaagggca acttacttgg agttagagcc 3000 
atgttagctt ctttctgaga aaaaaatacc 3060 
tgctgtgaag attgaaaatg aacaaagtta 3120 
tcagatgagt ttttcgttgt aggagcactt 3180 
ctacaagata gtacattttg tagcagttca 3240 
tgtgccagtt aatcatagga tcccattaag 3300 

3301 



<210> 36 
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<211> 1703 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Identification No.: 1444908CB1 



<400> 36 

gcttcccctc cgtcatccat 
gttccatcga gtttctcggc 
cgtggagatc gctgggagcg 
gccctgccag gccgcacaca 
tagtgatggc agtaggtgga 
ctctgcttcc agttgggaac 
ttggatttga agaagtcatt 
aattcaagat gaaaatgaag 
ctgcagattc tttgcgctac 
gtgatctgat aacagacgtt 
catcacttgc tatgttgatg 
^9'g^99^3.aaa aaaagcagtg 
ggctgctctt catggctaat 
tcctacagaa gcatcctaga 
gtttgaaaaa atacatcgtg 
gtgaactgat tccatattta 
aagaagaaaa agaggaggat 
ttataaaaga agecaataca 
gaggagacag gtgggaagac 
aagaggggct ctgctctcga 
tgcccaaatt gctgtctgct 
ttgtcagcaa acacctggtt 
agaagtcatc cattaagcgc 
ctattaccaa ttgccttctc 
gcagtgtcat ctgcaacaat 
ttggaagtgg ccagaggatt 
atgaccagct catggagatc 
aagccacaga tgttggccgg 
ggcttccaaa ggcaaaaaaa 



cgctcggcgg tcttctcttc 
catcgcgcgc ctgcgccatt 
gttgcggcgt gcggggagct 
agctggctga cccggtttgt 
ggatctcgga tgacagacct 
aaacctttaa tttggtaccc 
gtggttacaa ccagggatgt 
ccagatattg tgtgtattcc 
atatatccaa aacttaagac 
gccttacatg aggttgtgga 
agaaaaggcc aagatagcat 
gagcagcgtg acttcattgg 
gaagcagact tggatgaaga 
atacgtttcc acacgggtct 
gatttcctaa tggaaaatgg 
gtgagaaaac agttttcctc 
ctaaagaaaa aggagctgaa 
ctgaacctgg ctccctatga 
ttgtccagat cacaggtgcg 
gtgagcacac tgggactcta 
ctctgtccag aagaaccacc 
ggagttgaca gcctcattgg 
tcagtcattg gctcatcctg 
atgaactcag tcactgtgga 
gctgtgatcg agaagggtgc 
gaagccaaag ctaaacgagt 
tgagttctga gcaagtcaga 
cccacctgtt taactctgta 
aaa 



tccatgggtc tgctctgcgc 60 
gggctgtcag tcagaggcgg 12 0 
gagttatagc tgtgacttct 180 
aaaaatggaa tttcaagcag 240 
aacttccagc attcccaaac 300 
attgaacctg cttgagcgtg 360 
tcaaaaggct ctatgtgcag 420 
tgatgatgct gacatgggaa 480 
agatgtgctg gtgctgagct 540 
cctgtttaga gcttatgatg 600 
agaacctgtt cccggtcaaa 660 
agtggacagc acaggaaaga 720 
gctggtcatt aagggatcca 780 
tgtggatgcc cacctctact 840 
gtcaataact tctatccgga 900 
agcttcctca caacagggac 960 
gtccttagat atctacagtt 1020 
tgcctgctgg aatgcctgtc 1080 
ctgctatgtc cacatcatga 114 0 
catggaagca aacagacagg 1200 
agtccattcg tcagcccaga 1260 
gccagagaca cagattggag 132 0 
tctcataaaa gatagagtga 1380 
ggaaggaagc aatatccaag 144 0 
agacatcaag gactgcttga 1500 
gaatgaggtg atcgtgggga 156 0 
ctccttcctt ttggcctcca 1620 
tttatttccc aataaagaag 1680 

1703 



<210> 37 
<211> 2536 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte Identification No. : 1557481CB1 
<400> 37 

gacttccggg gcggcggttg catcagattc taggaagtgt ctgtagccgc agctgcgggt 6 0 
ccgggattcc cagccatggc agattcctcc gggcagcagg gcaaaggccg gcgtgtgcag 120 
ccccagtggt cccctcctgc tgggacccag ccatgcagac tccaccttta caacagcctc 180 
accaggaaca aggaagtgtt catacctcaa gatgggaaaa aggtgacgtg gtattgctgt 24 0 
gggccaaccg tctatgacgc atctcacatg gggcacgcca ggtcctacat ctcttttgat 300 
atcttgagaa gagtgttgaa ggattacttc aaatttgatg tcttttattg catgaacatt 360 
acggatattg atgacaagat catcaagagg gcccggcaga accacctgtt cgagcagtat 420 
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cgggagaaga ggcctgaagc 
ttttcagtaa aattaaatga 
cagcacgcag tgcagcttgc 
ggagaggaag tcaacagctg 
gactggctgg attctacact 
cccaagttct gggaggggga 
gatgtcttaa cccgggttag 
gtggacaacg gttacggcta 
gcttctagcg agaagcactc 
gcccttcaag aaggggaagg 
tctcccaacg actttgcctt 
ccttggggaa agggtcgtcc 
ctaggggctt cgatggacat 
aatgagctgg cacagtcgga 
cacacaggcc acctgaccat 
accattaaag atgccttgaa 
cactcgtgga aggacaccct 
gagaagttct tgaatgagtt 
atcactggtc agtttgagaa 
gacaagaaga cagcaattca 
gaagagatgc gggccttggt 
aggaagaggc ccaaccaggc 
aagatctttg gggccgtaga 
accagcctca gtctcgaggc 
gaaggagtgc ggaagattgc 
gccctgcggg acaacatcct 
cccacagtgg tgaaactggt 
cgggttgaag aggagaagag 
gaagcagcaa agctggccaa 
gacaaatact ccaagtttga 
ctcagcaaag ggcaagccaa 
aaggaatatc tgcagatggc 
ttttaaacca ttgtggacta 
tatcatgttt acagtcaccc 
tcgtcgttgg ctctgagaca 
gaaaaaaaaa aaaaaa 



ggcacagctc ttggaggatg 
gaccacggat cccgataaaa 
cacagagcca cttgagaaag 
tgtggaggtg ttgctggaag 
tggctgtgat gtcactgaca 
cttccacaga gacatggaag 
tgagtatgtg ccagaaattg 
tgtctccaat gggtctgtct 
ctatgggaag ctggtgcctg 
tgacctgagc atctctgcag 
atggaaggcc tctaagcccg 
gggctggcat atcgagtgct 
tcacggaggt gggttcgacc 
ggcctacttt gaaaacgact 
tgcaggctgc aaaatgtcaa 
aaagcactca gcacggcagt 
ggactactcc agcaacacca 
tttcttaaat gtgaaagata 
gtggggagaa gaagaagcag 
caaagccctc tgtgacaatg 
cagtcagtgc aacctctata 
tctgctggag aacatcgccc 
agaggacagc tccctgggat 
cacagtcatg ccctaccttc 
ccgagagcaa aaagtccctg 
gcccgagctt ggggtgcggt 
agacagaaac accttattaa 
gaagaagaaa gaggaggcgg 
gatgaagatt ccccccagtg 
tgaaaatggt ctgcccacac 
gaagctgaag aagctcttcg 
ccagaatgga agcttccagt 
gtggctgctg tctgcctcag 
ttgggtccta aattaagagt 
ttgataataa atttttctca 



ttcaggccgc cctgaagcca 480 

agcagatgct cgaacggatt 54 0 

ctgtgcagtc cagactcacg 600 

aagccaagga tttgctctct 660 

attccatctt ctccaagctg 720 

ctctgaatgt tctccctcca 78 0 

tgaactttgt ccagaagatt 84 0 

actttgatac agcgaagttt 900 

aggccgttgg agatcagaaa 960 

accgcctgag tgagaagcgc 102 0 

gagaaccgtc ctggccgtgc 108 0 

cggccatggc aggcaccctc 114 0 

tccggttccc ccaccatgac 1200 

gctgggtcag gtacttcctg 1260 

agtcactaaa aaacttcatc 1320 

tgcggctggc cttcctcatg 1380 

tggagtcagc gcttcaatat 144 0 

tccttcgcgc tcctgttgac 1500 

aactgaataa gaacttttat 1560 

ttgacacccg caccgtcatg 1620 

tggcagcccg gaaagccgtg 1680 

tgtacctcac ccatatgctg 1740 

tcccggtcgg agggcctgga 18 00 

aggtgttatc agaattccga 1860 

agattctgca gctcagcgat 1920 

ttgaagacca cgaaggactg 1980 

aagagagaga agaaaagaga 2 04 0 

cccggaggaa acaggaacaa 2100 

agatgttctt gtcagaaacc 2160 

atgacatgga gggcaaagag 22 2 0 

aggctcagga gaagctctac 2280 

gagggggcac aggactgact 234 0 

tgacaatgtc ccagcgctcc 2400 

tgtgttcatg taggttcgtg 2460 

acagtgaaaa aaaaaaaaaa 2520 

2536 



<210> 38 

<211> 1350 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte Identification No . : 1747456CB1 
<400> 38 

cggaggagcc cgggcggggg ggaggaggag ggggaggagg gagcggagat ctcggggctc 6 0 

ggagccggcc gccgctccgc tccgatcgct gtggggcttg gttttttggg ggtggggggg 12 0 

cgggggggct cagatatgga ggcaaatggg agccaaggca cctcgggcag cgccaacgac 18 0 

tcccagcacg accccggtaa aatgtttatc ggtggactga gctggcagac ctcaccagat 240 

agccttagag actattttag caaatttgga gaaattagag aatgtatggt catgagagat 300 

cccactacga aacgctccag aggcttcggt ttcgtcacgt tcgcagaccc agcaagtgta 36 0 

gataaagtat taggtcagcc ccaccatgag ttagattcca agacgattga ccccaaagtt 42 0 

gcatttcctc gtcgagcgca acccaagatg gtcacaagaa caaagaaaat atttgtaggc 48 0 

gggttatctg cgaacacagt agtggaagat gtaaagcaat atttcgagca gtttggcaag 54 0 



39/47 



wo 00/11171 



PCTAJS99/19361 



gtggaagatg 
gtcacttttg 
aataataaaa 
acaagaggcc 
atgctgggat 
agctatggct 
gcggtggcgg 
gccaacagcc 
gggaattaca 
ggacctttga 
atctcactct 
Gctttgttta 
cagagatggc 
taatataaga 



caatgctgat 
agaatgaaga 
tggtagaatg 
gggcccgggg 
atcccaactt 
atcagttccc 
cagcaagagg 
caggacctgt 
taagtgcggc 
ttgcaacggc 
gagagcatac 
ggtgatgtcc 
tcacttcgga 
caacagcttt 



gtttgataaa 
tgttgtggag 
taagaaagct 
actgccttac 
cgtggcgacc 
aggcttccca 
atcaggctcc 
cgccgatctc 
cagcccacag 
ctttacaaat 
ctggatgtcc 
tcagacctgg 
tcgagggttg 
taaatgtgta 



actaccaaca 
aaagtctgtg 
cagccgaaag 
accatggacg 
tatggccgtg 
gcagcggctt 
aacccggcgc 
tacggccctg 
ccgggctcgg 
ggataccatt 
aggcaagact 
acccccacca 
actacatctc 



ggcacagagg 
agattcattt 
aagtcatgtt 
cgttcatgct 
gctaccccgg 
atggaccagt 
ggcccggagg 
ccagccagga 
gcttcggcca 
gagcaggtgc 
gggcgaagtt 
gcctcactcc 
atcatctcac 



gtttggcttt 
ccatgaaatc 
cccacctggg 
tggcatgggg 
atttgctcca 
ggcagcagcg 
cttcccgggg 
ctccggagtg 
cggcatagct 
tttcgttgcc 
tctgagtggc 
ccatcccaac 
gaatctgctg 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1350 



<210> 39 
<211> 2190 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Identification No. : 1748626CB1 
<400> 39 

cctgcgggag ccccgtgccc gtcacgccgc cgcagccttc gccctggatg ccgctgctgc 60 
cgccgccgtg ggactgtccc gggagcgggc cctggactac tacgggctgt acgacgaccg 120 
tgggcgcccc tatggctacc cagctgtgtg tgaggaggac ctgatgcccg aggatgacca 180 
gcgggccacg cgcaacctct tcattggtaa cctggaccac agcgtatctg aggtggagct 240 
gcgaagggcc ttcgagaaat atggcatcat cgaggaggtg gtcatcaaga ggcctgcccg 3 00 
tggccagggc ggtgcctatg ccttcctcaa gttccagaac ctggacatgg cccatagggc 36 0 
taaggtggcc atgtcgggcc gagtgattgg tcgcaacccc attaagatag gctatggcaa 42 0 
ggccaacccc accactcgtc tctgggtggg tggcctggga cctaacacgt cactggcggc 480 
tctggcccga gagtttgacc gctttgggag cattcggacc attgatcacg tcaaaggaga 540 
tagctttgcc tatattcagt acgagagctt ggacgcagcc caggccgcct gtgctaaaat 600 
g^gggg'tttt cccttgggtg gaccagaccg caggctccgc gtggattttg ccaaagcaga 660 
ggagactcgg tacccccagc agtaccagcc ctcgccactc cctgtgcatt atgagctgct 720 
cacagatgga tacacccggc accgcaacct ggacgccgac ctggtgcggg acaggacgcc 780 
cccacacctt ctgtactcag accgagaccg gacttttttg gaaggggact ggaccagccc 840 
cagtaaaagc tctgaccgcc gaaacagcct tgagggctac agtcgctcag tgcgcagccg 900 
gagtggtgag cgttgggggg cagatggaga ccgtggtttg cccaagccct gggaagagag 96 0 
gcggaaacgg agaagccttt ccagtgaccg tgggaggaca acccattcac catatgagga 1020 
acggagtagg accaagggca gtgggcagca gtcagagcgg ggctccgacc gcacccctga 108 0 
gcgcagccgc aaggagaacc actccagtga agggaccaag gagtccagca gcaactccct 114 0 
cagcaacagc agacatgggg ctgaggaacg gggccaccac caccaccacc acgaggctgc 1200 
agactcttcc cacgggaaga aggcaagaga cagcgagcgc aatcaccgga ccacagaggc 126 0 
cgagcccaag cctctggaag agccaaaaca cgagaccaaa aagctgaaga atctttcaga 1320 
gtacgctcag acactacagc tgggttggaa tgggcttctg gtgttgaaaa acagctgctt 1380 
ccccacgtct atgcatatcc tagaggggga ccagggggtg atcagcagtc tcctcaaaga 144 0 
ccacacttct gggagcaagc tgacccagct gaagatcgcc cagcgccttc gactggacca 1500 
gcccaagctt gacgaggtca cacgacgcat caagcagggg agccccaacg gctatgcggt 1560 
cctcttagcc acccaggcaa cccccagtgg gcttggcact gaggggatgc ccacagtaga 1620 
gcccggtctg cagaggcggc ttctcaggaa cctggtctcc tacttgaaac agaagcaggc 1680 
cgcaggggtg atcagcttgc cagtgggggg gtccaagggc agagacggca caggcatgct 174 0 
ctacgccttc ccaccctgcg acttttccca gcagtacctc cagtcagcac taaggacatt 1800 
gggcaagcta gaagaagaac acatggtgat agtcatcgtc agagacactg cctagcccaa 186 0 
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gcctgtcttt cccagcgtca tgtttgtgtc acaaaagcag ttattttaaa atctgatccc 1920 

ctctctaccc taccactttg gtttgaatta tctcctgggt tattttggtt catttgggtg 1980 

gggatcaaag tcctgtccac caccaaaact aagttcttag attfctggggg attttttttt 2040 

ttaaacgatg agaagggaat ccggttatgt tgatttctag tgtacaagat actgtctgct 2100 

gtggttctgt atttttttat tttttgacca actgtatgga aagttgtcag taaaaccttt 2160 

gacagaggat ggatttttaa aaaaaaaaaa 2190 



<210> 40 
<211> 680 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc feature 

<223> Incyte Identification No. : 1879135CB1 



<400> 40 

gtctttggct gtgtggtctt aaatgtgttt ctaatgtgtg tgtcaaataa ttacctgtta 60 

aacagactgc caatctggct gaagccaatg cttctgaaga agataaaatt aaagcaatga 12 0 

tgtcgcaatc tggccatgaa tacgacccaa tcaattacat gaagaaacct ctaggtccac 180 

cacctccatc ttacacgtgt ttccgttgtg gtaaacctgg acattatatt aagaattgcc 240 

caacaaatgg ggataaaaac tttgaatctg gtcctaggat taaaaagagc actggaattc 300 

ccagaagttt catgatggaa gtgaaagatc ctaatatgaa aggtgcaatg cttaccaaca 360 

ctggaaaata tgcaatacca actatagatg cagaagcata tgcaattggg aagaaagaga 42 0 

aacctccctt cttaccagag gagccatctt cttcctcaga agaagatgat cctatcccag 480 

atgaattgtt gtgtctcatc tgcaaggata ttatgactga tgctgttgtg attccctgct 540 

gtggaaacag ttactgtgat gaatgtaaga agtgctgaat cttggaagat gtatatttta 600 

gaatatttgt atttacttgg aatggctctt cccaacctca tatgttttaa taataaaata 660 
aataatgttg aaaaaaaaaa 680 



<210> 41 
<211> 1150 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc__f eature 

<223> Incyte Identification No. : 2073417CB1 



<400> 41 

g^cggaagtg cggtgttgag cgccggcggc tcgcgcccac gctgggccgg gagtcgaaat 6 0 
gcttcccggt gccgggagtg agcgatgagc tggcttctgt tcctggccca cagagtcgcc 12 0 
ttggccgcct tgccctgccg ccgcggctct cgcgggttcg ggatgttcta tgccgtgagg 18 0 
agaccggggt ctttctgacc tggaatgagt gcagagcaca ggtggaccgg 24 0 
tttcctgctg ccagatttaa gaagtttgcc acagaggatg aggcctgggc ctttgtcagg 300 
aaatctgcaa gcccggaagt ttcagaaggg catgaaaatc aacatggaca agaatcggag 360 
gcgaaagcca gcaagcgact ccgtgagcca ctggatggag atggacatga aagcgcagag 42 0 
ccgtatgcaa agcacatgaa gccgagcatg gagccggcgc ctccagttag cagagacacg 48 0 
ttttcctaca tgggagactt cgtcgtcgtc tacactgatg gctgctgctc cagtaatggg 540 
cgtagaaggc cgcgagcagg aatcggcgtt tactgggggc caggccat cc tttaaatgta 60 0 
ggcattagac ttcctgggcg gcagacaaac caaagagcgg aaattcatgc agcctgcaaa 66 0 
gccattgaac aagcaaagac tcaaaacatc aataaactgg ttctgtatac agacagtatg 72 0 
tttacgataa atggtataac taactgggtt caaggttgga agaaaaatgg gtggaagaca 78 0 
agtgcaggga aagaggtgat caacaaagag gactttgtgg cactggagag gcttacccag 840 
gggatggaca ttcagtggat gcatgttcct ggtcattcgg gatttatagg caatgaagaa 900 
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gctgacagat tagccagaga aggagctaaa caatcggaag actgagccat gtgactttag 96 0 

tccttgggag aacttgagcc agcggctgtc ttgctgcctg tacttactgg tgtggaaaat 1020 

agcctgcagg taggaccatt gcagtgatgg gcagatgcgt ctttcacacg gaatcaggca 1080 

cagtggcctt ctgtgacatg tgtttataaa aaatggttaa gtatataata aattgaacat 1140 
ctttgagatt 1150 



<210> 42 

<211> 2545 

<212> DNA 

<213> Homo sapiens 



<220> 

<22i> misc_f eature 

<223> Incyte Identification No. 



2129080CB1 



<400> 42 

ggcagggagc 

cgaggaggac 

cgactactcg 

tggcggcggc 

ttctgtttca 

cgtggaagcg 

acaggctcta 

agatgaaccc 

gatcactcgg 

aattcagaat 

tggcaaagtg 

tgaatcagtc 

tggatgttgc 

tgacaatgac 

tcgccagaga 

atcccatggt 

tgaacttgtt 

ctctggttca 

cttcaacctg 

tggtacagca 

taataatgtc 

tgttccaagt 

gagcaaaaat 

ctcctgtgtt 

gttgtgtaat 

ttcagccaaa 

ccttacggca 

gaagctttgc 

gttcaccttt 

tctcatgttg 

taaatttgtt 

tagactatct 

gaatttttct 

tttttcattg 

attatatgtt 

ttattatttt 

tttaactgat 

ttcaaactta 

cctctgtttc 

attttcattt 



ggagacggag 
cgggagtacg 
gccgaggaag 
ggcggccgga 
cccgtcgtcc 
ctggaaaaat 
gtggaatttg 
gtgtacattg 
ccaggaaata 
ccgctttatc 
caacgtattg 
ctttgtgccc 
acactaaaaa 
agttgggact 
caagccattt 
ccattattgc 
gcttatccat 
gttgtaatgg 
ttctgcttat 
ctggtagaaa 
aaattatttg 
caaatatttg 
aatcgcttta 
ttgcattatt 
gaccatgaag 
acactttctg 
ctgaatcact 
ttttctacat 
attacaattt 
ccttgcttac 
gtaagcttaa 
acagcttcat 
tccacagtaa 
tgtaagagat 
tagatgttta 
aatgaagttt 
tttatatttc 
tttaaattgg 
tagtccagtt 
gttgcatgtg 



gaggaggagg 
agagccaggc 
gcgagaaccg 
gcttctctca 
atgttcgagg 
ttgggacaat 
aaaacataga 
ctggtcaaca 
ctgatgatcc 
caattacagt 
ttatattcaa 
agaaagctaa 
ttgaatatgc 
acactaaacc 
tggg3.gaaca 
ctttaccaag 
taccacaggc 
ttagtggatt 
atggaaatat 
tgggtgatga 
ggaaaagact 
agctggagga 
caagtgctgg 
ataatgttcc 
ttcttacatt 
ggctattaga 
atcagataag 
catcccattt 
caaagctaca 
tttaagatcc 
atattgaatt 
tgtagattat 
aaatatattt 
taatgcaggt 
agtatgttgc 
gagaataacg 
tgaaaggcta 
tgtatttttt 
tggagatttt 
acttaaatac 



gagaggctga 
caagcgtctc 
c^cgggaagcg 
gccggaggca 
actctgtgaa 
atgctatgtg 
tagtgccaaa 
ggcttttttc 
atcaggaggc 
ggatgtttta 
gagaaatggg 
agcagcactc 
acggccaact 
atatttggga 
cccttcttcg 
tcgttacaga 
t tcttcctct 
acatcaacta 
tgagaaggta 
gtatgctgta 
taatgtttgc 
tggtaccagc 
ccaagcatct 
attgtgtgtc 
catcaaatat 
atgggagtgc 
agtgccgaat 
ataagaagag 
cttcattaaa 
tgttctgtaa 
gttttcattt 
acagatatat 
gcattcttaa 
gaaagtattg 
agttactcat 
ttacatatgt 
acagacatgg 
tttaagtcac 
ataaagttat 
agctagtatt 



atgttggctc 
aagaccgagg 
acgccccggg 
ggtggaagtc 
tctgtggtgg 
atgatgatgc 
gaatgtgtga 
aactattcta 
aacaaagttc 
tatactgtat 
atacaagcaa 
aatggagctg 
cgtctaaatg 
agacgagata 
tttagacatg 
atgggctctc 
tacatgcatg 
aaaatgaatt 
aaatttatga 
gaaagagctg 
gtgtctaaac 
agctacaaag 
aagaatataa 
acagaagaga 
aaagtgtttg 
aaaactgatg 
ggttccaatc 
aagagcatgt 
aaaaaatcta 
taaacatatt 
taagatagaa 
gatttctaac 
tgctaattat 
cattttaata 
attaaacata 
tgaattttaa 
atacacgtgt 
tgtccatttg 
aacaatgagt 
tggcattgag 



gggagacgta 
agggggagat 

gcgggggcga 
atcataaagt 
aagcagacct 
catttaaacg 
catttgctgc 
caagcaaaag 
ttctgctctc 
gcaaccctgt 
tggttgagtt 
atatatatgc 
ttattaggaa 
gaggaaaggg 
atggctatgg 
gagatacacc 
gaggaaatcc 
gttcaagagt 
agaccattcc 
tcacacacct 
aacattcagt 
attttgcaat 
tccagccacc 
ccttcacaaa 
atgcaaaacc 
cagtagaagc 
cctatacatt 
tagaatttat 
aaatggttga 
ttgccttgag 
tatcataatg 
cttattactg 
ctgcaagtat 
tagaattcct 
acttgtatat 
gtactacaga 
acagtatgca 
tattgacatg 
taatgtgttc 
attttaatag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
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aggttataat taacagttcc tcttatagat aataatctgg gatccatggg tgggcttcag 2460 

agaatctgtg taccccctaa aattgtatgt agaattttgg atatttacat ttttattttt 2520 
taactcttgg atccatcagt aatat 2545 



<210> 43 

<211> 907 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte Identification No. : 2472867CB1 



<400> 43 

caagcttggc 

tatttctgtt 

aaggtgttca 

cgcaaagtta 

aattcatttg 

aataaaacta 

tttataatga 

gtcaagcaaa 

gagaaaatgg 

ctgtaaccat 

ttattttatt 

cattgccatc 

tttactgata 

ttgtggacgt 

actaatggca 

aaaaaaa 



gtttgtttgg 
cggggcccat 
gattttgcaa 
ggtggaccaa 
aatttgaaaa 
ttgatgcgat 
acagattgaa 
acatccatct 
tacagcagtt 
ttcttttatg 
ttttacataa 
tacataatat 
gatgtactta 
aaaatggttg 
aaaattcatg 



tggggtcaca 
ctatcctgga 
atctaaatgt 
agcattccgg 
acgtagaaat 
gaagagagtt 
gaaaaataaa 
tatccgagcc 
acaagaggat 
tacatttgaa 
ggtcacttaa 
cagatattac 
agtaaatcat 
tgctatttgg 
gctagtgatg 



cgcgggttca 
cacggcatga 
cataaaaact 
aaagcagctg 
gaacctatca 
gaagaaatca 
gagctacaga 
cctcttgcag 
gtggacatgg 
aatgcccttt 
atgaaaagcg 
ggatgttaga 
gaaaattcta 
ataatggcac 
tataaaataa 



acatgcgtat 
tgttcgtccg 
ttaaaaagaa 
gtaaagagct 
aataccagcg 
aacagaagcg 
aagttcagga 
gcaaagggaa 
aagatgctcc 
ggatacttgg 
attaaaagac 
ttgcatctca 
cttgtaacta 
taggcagcat 
aatattcttt 



cgaaaagtgt 
caacgattgc 
gcgcaatcct 
tacagtggat 
agagctatgg 
ccaagctaaa 
tatcaaagaa 
acagttggaa 
ttaaaaatct 
aactgctaaa 
atctttcctg 
gtgttaaatc 
tagaagtgaa 
ttgtatagta 
gcagtaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
907 



<210> 44 

<211> 1104 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte Identification No. : 2764755CB1 



<400> 44 

cttcgtttag 

gcgagatatt 

aattgctgcc 

tcgacgggct 

gagtgacccg 

cgagtttctc 

atgggaaaat 

tacataaacc 

caagacacat 

gctacccagg 

caattgtaaa 

tggaaaggtt 

tagaggagct 

aaatagacgc 



gtcggctgga 
tttgggagtt 
gtggagaaga 
aggaactgtc 
gcgtggctac 
tagggcgccc 
gcagccacct 
tgtgtaccat 
tgcattttct 
tggatttaga 
actagctatt 
gcatcttttt 
tcctcaacca 
cttcccaaga 



aattatgtcc 
attccctaaa 
ctggaggaaa 
ctgcttgggt 
taggagaagg 
cagcaatggg 
ggcaaacttg 
gcactgagtg 
ggaaacaaat 
caagtaacag 
tatggcatgc 
ccagatgagt 
cgaaaaatac 
ttgtggactc 



tccgtcggtt 
taactgcatt 
ctcgaggaag 
gttagcgttt 
acgtacggtc 
ccacttttgc 
ctgctatggc 
actgtgggga 
gggaacaaaa 
ctgctcagct 
tgccaaaaaa 
atattccaga 
ctaaacgtct 
cacctgaaga 



ttccgcagtt 
atatgctcct 
agggagaagc 
cccgccgggc 
ctgctagtag 
tagaatatgg 
atctataaga 
tcatgttgtt 
agtatactct 
tcacctgagg 
ccttcacaga 
agatattctt 
agatgagtac 
ttatcggcta 



tttccaccaa 
ttcatgacga 
cgacaagtgc 
cagtaaggct 
aggaatatgt 
tatctcttag 
cttcagggat 
ataatgaaca 
tcgcatactg 
gatccagtgg 
agaacaatga 
aagaatttag 
acacaagaag 
taagagaata 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
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agaattgcag aaaataacag 
tacaggatgg agtaaaacaa 
tggggaaagg tcaggaaggt 
atagaaccat ttttafcgtaa 
acttggttga ctaaaaaaaa 



tgaagtgafct gaaactttct 
ctgctacagt tcagcacctg 
gtagtccttc aataggaaat 
tctgatttga atgttatagt 
aaaa 



tctgatgagt ttctctaacc 900 
ttttatgtgc cgaatcactg 96 0 
tgtaattaaa atataatttt 1020 
tgataataat aaaatcactt 1080 

1104 



<210> 45 
<211> 910 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__f eature 

<223> Incyte Identification No. : 
<400> 45 

cccacgcgtc cgcgattcat agctcgcagg 
gttcaggacc ccggcgcggg cagggcgccc 
cttctttctc tgggacctgg ggtcgcggtt 
cctggcgggg agcgggcctc gcgcgcctgg 
gcatggcaga aaaccgagag ccccgcggtg 
acacccttag gaaaaggctt cccagccgcc 
acatgaagac ggactttaag gcccagctgg 
cccggggtca gaacgcgtgc tctgagatct 
gcgccatcaa catcgcgctg cagctgcagg 
ccaatacctc caccgtggag cttgttgatg 
cactgactcg gatccgcaac aactcagcca 
agtaattgaa aagacactcc tccacttatc 
gtactggacc tcggaccaga gccatgtaag 
ctctgcattg agggtggggg taattgtctc 
gtgtgtgtat gcggtctgta actattgcca 
aaaaaaaaaa 



2875939CB1 



gtacgggcgc gcgtgcgcac tccgcagccc 6 0 

acgagctggc tggctgcttg cacccacatc 12 0 

acttgggctg gccggcgaac ccttgagtgg 180 

^9'99ccc:tgt ggaacgaaga gaggcacaca 240 

ctgtggaggc tgaactggat ccagtggaat 3 00 

tgccccggag acccaatgac atttatgtca 360 

cccgctgcca gaagctgctg gacggagggg 420 

acattcacgg cttgggcctg gccatcaacc 4 80 

<=:gggcsgctt cgggtccttg caggtggctg 540 

agctggagcc agagaccgac acacgggagc 6 00 

tccacatccg agtcttcagg gtcacaccca 660 

ccctccgtga tatggctctt cgcatgctga 720 

aaaaggcctg ttccctggaa gcccaaagga 780 

ttggtgggcc cagttagtgg gccttcctga 840 

tataaataaa aaatcctgtt gcactagtaa 900 

910 



<210> 46 
<211> 733 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> Tnisc_f eature 

<223> Incyte Identification No. : 3591363CB1 



<400> 46 

gttcgcgtcg 

tcttccctga 

ccttcggggc 

acttcatatc 

gttgttctgc 

aatgagatga 

cgccagtgga 

tgtggagaga 

cttggcaagc 

acagaatcat 

agaatcgtca 

ttgttgtgat 



tttccgtttc 
cggtcacctt 
ccggagcagc 
taccaggata 
cagacccagt 
tcgtcacggg 
^ggtgacctc 
gaattcgact 
cactcctcgg 
ggccaagaat 
cgaccccgca 
taccgagtta 



cggccgaggc 
^gggcggctg 
ctccctttgg 
tgttcctaaa 
tgaggagacc 
gcagtatggc 
tgaagacctg 
ggagaaggtc 
aaaggatctt 
cattatgaga 
gactgtcctc 
atacttacaa 



tgcgggaaga 
gcgtccgcgt 
tctgcttctc 
acatccctga 
agacaccatg 
aggctctttg 
atcttaattg 
ctgctggttg 
gttcgagtag 
ttcaggaaaa 
cggataaaca 
aaggataaaa 



tggcggcggc 
gcagccacag 
gaaggttcaa 
gttcaccacc 
cagaggtcgt 
Gcgtggtgca 
gaaatgaact 
gggcagacaa 
aagccacagt 
ggaaaaactt 
gcattgagat 
ataaactcct 



catggcagca 
catcctgaga 
ttcacagagc 
ttggccagaa 
gaagaaggtg 
ctttgccagc 
agaccttgcg 
cttcacgctg 
cattgaaaag 
caagaagaaa 
tgctccgtgt 
gcttcccaag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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gaaaaaaaaa aaa 733 



<210> 47 
<211> 918 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Identification No. : 3702292CB1 



<400> 47 

cgcatgcggc cgagtgcggg 
cgggtgccgc ccgcctagag 
tcaagctttc accagttttt 
gtcgacattc gttattaact 
cccagcagag agcctacgac 
acatgcagaa gtcatgctgg 
atttaggaga tggcaccaaa 
gcgaaggctt taaaggccaa 
cagaagatcc gcatgtcact 
cccagatatc ctcggaagag 
aagtttccgc tggccactga 
ttcactgtgg atgttaaagc 
gacagtgatg tggccaaggt 
gttcgacttg ctcctgatta 
ctgagtccag ctggctaact 
ttttcataaa aaaaaaaa 



actggggctc ctggctgtgg 
ggagtggagc ggtgagcacg 
aattctttga tggggtaaat 
gagcaggaat caggagagga 
tagatttgca tctttacgtc 
tggcctggac agtgaaggga 
agcgaaggaa gctcctgctc 
gaaggcagtg ttgaaaggtg 
caccttcagg cggcccaaga 
cacccccagg agaaacaagc 
g^tcggccgtg aagaagatag 
caacaagcac cagatcagac 
caccaccctg atttgtcctg 
tgatgctttc gatgttgtaa 
ctaaatatat gtgtatcttt 



gtgtggtacc gaggcttcag 6 0 
tcaggggtgg ggggcgcagg 12 0 
ttgagcaatt ttctcgactt 180 
acccggtcct ctccacacag 240 
ctgcgcggag gctgctacac 3 00 
gagaagtgga tttgggagac 36 0 
atcctaaagc cgaagccaaa 420 
tccgcagcca cacgcaaaaa 480 
cactgcgact ccggaggcag 540 
ttggccacta tgctatcatc 600 
aagaaaacaa cacgcttgtg 66 0 
aggctgtgaa gaagctctat 72 0 
ataaggagaa caaggcatat 780 
caaaattggg atcacctaaa 84 0 
tcagcataaa aaaataatgt 900 

918 



<210> 48 

<211> 2680 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte Identification No. : 3778908CB1 



<400> 48 

agtggaactg gatcgggttt 
ctccactcgc gccggacaca 
ggacaggatt tctccgcctc 
tgttaatgga aatggtactg 
aaattttcag acattgcttg 
aatttacgtt gcagggctag 
aaaagaattc aatgaagacg 
ctctcatgtt cagaacaaaa 
agaaaaacaa gggaccaaag 
ggcactcttg gaaagaacag 
aggaccacct ccagattccg 
tgtgggaaag atcccaagag 
tggacctata tgggatcttc 
tgcgtttgtc actttttgta 
tcatgaaatt cgttctggaa 
ttcngcgggc tctattccta 



gctgccagcg gcgtgagctt 
gggagcagcg agcacgcgtt 
agcccaacgg ggagatctct 
aagagcccat ggatactact 
atgctggttt accacagaaa 
ttgcacatag tgatttagat 
gtgcattggc agttcttcaa 
gtgccttttt atgtggagtc 
tagcagattc tagtaaagga 
gctacacact tgatgtgacc 
tttattcagg tcagcagcct 
atctatttga ggatgaactt 
gtctaatgat ggatccactc 
caaaagaagc agctcaggag 
aacatattgg tgtctgcatc 
agagtaaaac caaggaacag 



cggccgccat tttacaacag 60 
tcccgcaacc cgataccatc 12 0 
ggaaacatgg ctacagaaca 180 
tctgcagtta tccattcaga 240 
gttgctgaaa aactagatga 300 
gaaagagcta ttgaagcttt 36 0 
cagtttaaag acagtgatct 42 0 
atgaagactt acaggcagag 4 80 
ccagatgagg caaaaattaa 54 0 
actggacaga ggaagtatgg 6 00 
tctgttggca ctgagatatt 660 
gttccattat ttgagaaagc 720 
actggtctca atagaggtta 780 
gctgttaaac tgtataataa 840 
tcagttgcca acaataggct 900 
attcttgaag aatttagcaa 960 
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agtaacagag ggtcttacag acgtcatttt 
cagaggcttt tgctttcttg aatatgaaga 
gttaatgagt ggtaaagtca aggtctgggg 
tatagaagat cctgatcctg aggttatggc 
tgccaatact gtaacagaag agattttaga 
acgagtgaag aagttaaaag attatgcgtt 
caaggctatg gaagaaatga atggcaaaga 
tgccaagcca ccagatcaga aaaggaaaga 
tcaaatgtat gacgattact actattatgg 
tcgagggcgt ggaggtagag gtggttatgg 
ttattatgat tattatggtt atgattacca 
ctatggttat gaagattttc aagttggagc 
tgctgctcca tccagaggtc gtggggctgc 
gagaggaggt cctggatcag caagaggcgt 
aagaggccgc ggggtacgtg gtgcgagggg 
caaagctgat gggtacaacc agccagattc 
gggctcccaa cccattgctc agcaaccgct 
ttacaaatct gaaaaccagg agttttatca 
acagtagggc ctctgtaaaa ttggagactg 
gaacgggtgc cgctataatt tgtgacatct 
acaatccgct tggacacgaa caaagccaca 
tttttaattt ttttcaataa agatattctt 
catttgtgct tgagaaaatt tggctcaagt 
agtagtgttt agatttggtg tcttcaaagg 
atcttgtatc agaataactt tgtatgttac 
atacacaact gctattttta atttagaact 
tggctacttg tattctttat gctgttgttt 
tacttactaa aattgtgttt acaattcgtt 
aattgtatag catgtcaagg ccaattaaag 



ataccaccaa ccggatgaca agaaaaaaaa 1020 
tcacaaaaca gctgcccagg caaggcgtag 1080 
gaatgttgga actgttgaat gggctgatcc 1140 
aaaggtaaaa gtgctgtttg tacgcaacct 1200 
aaaggcattt agtcagtttg ggaaactgga 1260 
cattcatttt gatgagcgag atggtgctgt 132 0 
cttggaggga gaaaatattg aaattgtttt 1380 
aagaaaagct cagaggcaag cagcaaaaaa 1440 
tccacctcat atgccccctc caacaagagg 1500 
atatcctcca gattattatg gatatgaaga 1560 
taactatcgt ggtggatatg aagatccata 1620 
tagaggaagg ggtggtagag gagcaagggg 16 8 0 
tcctccccgc ggtagagccg gttattcaca 1740 
tcgaggtgcg agaggaggtg cccaacaaca 1800 
tggccgcggt ggaaatgtag gaggaaagcg 1860 
caagcggcgc cagaccaata atcagaactg 1920 
ccaaggtggt gatcattctg gtaactatgg 1980 
ggatactttt gggcaacagt ggaagtagaa 2 04 0 
ataggttgat cagaaactca ccctaaatct 2100 
ggcaagattt ccctttatgt atatatttta 2160 
cttctaactg cttctggcga actgatttta 2220 
agatactgaa agaaatagtt aatgagtttg 2280 
ccatttggct gtagtgtcaa cgatgtttcc 2340 
tagttgatta aaaccaagtg tgtctttaat 2400 
caacttaaat tgctagaata aggtaaattg 2460 
ttgacctaat ttgggttttc aaaaccattt 2520 
atttcaataa aaaattcaca cctaaatgta 2580 
tttcacaaaa tttcctgcaa atttggttca 2640 
ggttttgtga 2680 



<210> 49 

<211> 2568 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte Identification No . : 4163642CB1 
<400> 49 

ccaggctgcg ccagacagtg tagaacctgc ggcctcgatg tccttctccc gtgccctatt 60 
gtgggctcgg ctcccggcgg ggcgccaggc tggccaccgg gcagccatct gctctgccct 120 
tcgtccccac tttgggccct ttcccggggt tctggggcaa gtttctgtcc ttgccaccgc 180 
ctcctcctct gcctccggtg gctccaaaat accaaacacg tccttgttcg tgcccctgac 240 
tgtgaaacct cagggcccca gcgccgacgg cgacgtcggg gccgagctaa cccggcctct 3 00 
ggacaagaat gaagtaaaga aggtcttaga caaattttac aagaggaaag aaattcagaa 360 
actgggtgct gattatggac ttgatgctcg tctcttccac caagctttca taagctttag 420 
aaattatatt atgcagtctc attccctgga tgtggacatt cacattgttt tgaatgatat 480 
ttgcttcggt gcagctcatg cggatgattt attcccattt ttcttgagac atgccaaaca 540 
aatatttcct gtgttggact gtaaggatga tctacgtaaa atcagtgact taagaatacc 600 
acctaactgg tacccagatg ctagagccat gcagcggaag ataatatttc attcaggccc 660 
cacaaacagt ggaaagactt atcacgcaat ccagaaatac ttctcagcaa agtctggagt 720 
gtattgtggc cctctaaaat tactggcaca tgagatcttc gaaaagagta atgctgctgg 780 
tgtgccatgt gacttggtga caggtgaaga gcgtgtgaca gttcagccaa atgggaaaca 840 
ggcttcacat gtttcttgta cagttgagat gtgcagtgtt acaactcctt atgaagtggc 900 
tgtaattgat gaaattcaaa tgattagaga tccagccaga ggatgggcct ggaccagagc 960 
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acttctagga 
ggtgatggag 
tacccccatt 
ctgcattgtc 
gggattagaa 
aaaaaagttt 
catgggactt 
caatgaaaag 
tggcagagct 
tgaagatctc 
tggtcttcat 
actgtccaat 
ctgcaatatg 
gcgagtgagg 
ttcactgtta 
acgccgatac 
tgaagctgtc 
gtttccagat 
tggtgtgcac 
ggagggcttt 
aaggacacgc 
aggagagctg 
gaaacagcta 
gtctgggact 
ttctgttcct 
tttcctcata 
ttggtgtgtg 



ctgtgtgctg 
cttatgtaca 
tctgtgctgg 
tgttttagca 
tcagctgtta 
aatgatccca 
aatttgagca 
ggagagagag 
ggcagattca 
agtttattaa 
ccaactgctg 
ctcattgata 
gatgatttta 
tatgttttct 
cagtttgcca 
atcaaatggc 
cacgatgtct 
gccagcctta 
aatatcacta 
ccatcaggga 
ggcaccaaag 
tcccttgctt 
gaaaaagagt 
catccaaaag 
gttttttttt 
tgcatttact 
tgtgtgtttg 



aagaggttca 
caacggggga 
accatgcact 
agaatgatat 
tatatggcag 
atgacccatg 
taaggagaat 
aactagaacc 
gctcacggtt 
aggaaatttt 
agcagattga 
tttttgtaga 
aattttctgc 
gcacagctcc 
ggcagtatag 
ctttacttcc 
tggatcttta 
ttcgagatct 
aattgattaa 
gccagtcacg 
ctctagggag 
ccagattggt 
ggatgacaca 
ggacgagaag 
tttatttaat 
ccctcctcta 
tgtgtgtgtt 



tttgtgtgga 
ggaagtggag 
agaatcttta 
ttattctgtg 
tctcccacct 
caaaatcttg 
tattttttac 
aatcacaacc 
taaagaagga 
gaagaggcct 
aatgtttgcc 
cttttcacaa 
agagttgatc 
tatcaacaag 
caggaatgag 
acctaagaat 
cttgtggcta 
ccagaaagaa 
aatgtctgag 
attgtcagga 
taaagctact 
gcagcaagga 
acaaactgaa 
aaagaagaag 
tttgcaaata 
gtattgtggc 
tttttgtttg 



gaacctgctg 
gttcgagact 
gataaccttc 
agtcggcaga 
gggaccaaac 
gttgctacag 
tcccttataa 
tctcaagccc 
gaggttacaa 
gtggatccta 
taccatctcc 
gttgatgggc 
cagcatattc 
aagcagcctt 
cccctgacct 
attaaagacc 
agctaccgat 
ctagatggta 
acgcataagc 
accttaaaga 
gagccaccca 
ctcctcactc 
cacaacaaag 
gaacctgatt 
aaaatttatt 
tatctggtac 
ttttttcc 



ctattgacct 
ataagaggct 
ggcctgggga 
ttgaaattcg 
ttgctcaagc 
atgcaattgg 
agcccagtat 
tgcagattgc 
caatgaatca 
taagggcagc 
ctgatgcaac 
agtattttgt 
cactaagtct 
ttgtgtgttc 
ttgcatggfct 
tcatggatct 
ttatggatat 
ttatccaaga 
tgttgaattt 
gccaagctag 
gccccgatgc 
cagacatgct 
aaaaaacaga 
cggactagtt 
ttgaatcctt 
tgggggattt 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2568 



<210> 50 
<211> 847 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Identification No. : 4906154CB1 
<400> 50 

caaatggcgg atgacgccgg tgcagcgggg gggcccgggg gccctggtgg ccctgggatg 60 
gggccggagg aaggccgagg ataaggagtg gatgcccgtc accaagttgg gccgcttggt 12 0 
caaggacatg aagatcaagt ccctggagga gatctatctc ttctccctgc ccattaagga 180 
atcagagatc attgatttct tcctgggggc ctctctcaag gatgaggttt tgaagattat 240 
gccagtgcag aagcagaccc gtgccggcca gcgcaccagg ttcaaggcat ttgttgctat 3 00 
cggggactac aatggccacg tcggtctggg tgttaagtgc tccaaggagg tggccaccgc 36 0 
catccgtggg gccatcatcc tggccaagct ctccatcgtc cccgtgcgca gaggctactg 420 
ggggaacaag atcggcaagc cccacactgt cccttgcaag gtgacaggcc gctgcggctc 4 80 
tgtgctggta cgcctcatcc ctgcacccag gggcactggc atcgtctccg cacctgtgcc 54 0 
taagaagctg ctcatgatgg ctggtatcga tgactgctac acctcagccc ggggctgcac 600 
tgccaccctg ggcaacttcg ccaaggccac ctttgatgcc atttctaaga cctacagcta 660 
cctgaccccc gacctctgga aggagactgt attcaccaag tctccctatc aggagttcac 720 
tgaccacctc gtcaagaccc acaccagagt ctccgtgcag cggactcagg ctccagctgt 78 0 
ggctacaaca tagggttttt atacaagaaa aataaagtga attaagcgtg ttaaaaaaaa 840 
aaaaaaa 847 
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